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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Artavanis-Tsakonas , S. et al. 



(ii) TITLE OF INVENTION: Therapeutic And Diagnostic Methods 
And Compositions Based On Notch Proteins And 
Nucleic Acids 

(iii) NUMBER OF SEQUENCES: 21 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Pennie & Edmonds 

(B) STREET: 1155 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: U.S.A. 

(F) ZIP: 10036 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

( C ) OPERATING SYSTEM : PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To be assigned 

(B) FILING DATE : On even date 

(C) CLASSIFICATION: 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Misrock, S. Leslie 

.(B) REGISTRATION NUMBER: 18,872 

(C) REFERENCE / DOCKET NUMBER: 7326-018 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 212 790-9090 

(B) TELEFAX: 212 8698864/9741 

(C) TELEX: 66141 PENNIE 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2892 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 142.. 2 640 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GAATTCGGAG GAATTATTCA AAACATAAAC ACAATAAACA ATTTGAGTAG TTGCCGCACA 60 

CACACACACA CACAGCCCGT GGATTATTAC ACTAAAAGCG ACACTCAATC CAAAAAATCA 120 

GCAACAAAAA CATCAATAAA C ATG CAT TGG ATT AAA TGT TTA TTA ACA GCA 171 
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Met His Trp He Lys Cys Leu Leu Thr Ala 
1 5 10 

TTC ATT TGC TTC ACA GTC ATC GTG CAG GTT CAC AGT TCC GGC AGC TTT 219 
Phe He Cys Phe Thr Val lie Val Gin Val His Ser Ser Gly Ser Phe 
15 20 25 

GAG TTG CGC CTG AAG TAC TTC AGC AAC GAT CAC GGG CGG GAC AAC GAG 267 
Glu Leu Arg Leu Lys Tyr Phe Ser Asn Asp His Gly Arg Asp Asn Glu 
30 35 40 

GGT CGC TGC TGC AGC GGG GAG TCG GAC GGA GCG ACG GGC AAG TGC CTG 315 
Gly Arg Cys Cys Ser Gly Glu Ser Asp Gly Ala Thr Gly Lys Cys Leu 
45 50 55 

GGC AGC TGC AAG ACG CGG TTT CGC GTC TGC CTA AAG CAC TAC CAG GCC 363 
Gly Ser Cys Lys Thr Arg Phe Arg Val Cys Leu Lys His Tyr Gin Ala 
60 * 65 70 

ACC ATC GAC ACC ACC TCC CAG TGC ACC TAC GGG GAC GTG ATC ACG CCC 411 
Thr lie Asp Thr Thr Ser Gin Cys Thr Tyr Gly Asp Val lie Thr Pro 
75 80 85 90 

ATT CTC GGC GAG AAC TCG GTC AAT CTG ACC GAC GCC CAG CGC TTC CAG 459 
lie Leu Gly Glu Asn Ser Val Asn Leu Thr Asp Ala Gin Arg Phe Gin 
95 100 105 

AAC AAG GGC TTC ACG AAT CCC ATC CAG TTC CCC TTC TCG TTC TCA TGG 507 
Asn Lys Gly Phe Thr Asn Pro He Gin Phe Pro Phe Ser Phe Ser Trp 
110 115 120 

CCG GGT ACC TTC TCG CTG ATC GTC GAG GCC TGG CAT GAT ACG AAC AAT 555 
Pro Gly Thr Phe Ser Leu lie Val Glu Ala Trp His Asp Thr Asn Asn 
125 130 135 

AGC GGC AAT GCG CGA ACC AAC AAG CTC CTC ATC CAG CGA CTC TTG GTG 603 
Ser Gly Asn Ala Arg Thr Asn Lys Leu Leu He Gin Arg Leu Leu Val 
140 145 150 

CAG CAG GTA CTG GAG GTG TCC TCC GAA TGG AAG ACG AAC AAG TCG GAA 651 
Gin Gin Val Leu Glu Val Ser Ser Glu Trp Lys Thr Asn Lys Ser Glu 
155 160 165 170 

TCG CAG TAC ACG TCG CTG GAG TAC GAT TTC CGT GTC ACC TGC GAT CTC 699 
Ser Gin Tyr Thr Ser Leu Glu Tyr Asp Phe Arg Val Thr Cys Asp Leu 
175 180 185 

AAC TAC TAC GGA TCC GGC TGT GCC AAG TTC TGC CGG CCC CGC GAC GAT 747 
Asn Tyr Tyr Gly Ser Gly Cys Ala Lys Phe Cys Arg Pro Arg Asp Asp 
190 195 200 

TCA TTT GGA CAC TCG ACT TGC TCG GAG ACG GGC GAA ATT ATC TGT TTG 795 
Ser Phe Gly His Ser Thr Cys Ser Glu Thr Gly Glu He He Cys Leu 
205 210 215 

ACC GGA TGG CAG GGC GAT TAC TGT CAC ATA CCC AAA TGC GCC AAA GGC 843 
Thr Gly Trp Gin Gly Asp Tyr Cys His He Pro Lys Cys Ala Lys Gly 
220 225 230 

TGT GAA CAT GGA CAT TGC GAC AAA CCC AAT CAA TGC GTT TGC CAA CTG 891 
Cys Glu His Gly His Cys Asp Lys Pro Asn Gin Cys Val Cys Gin Leu 
235 240 * 245 250 

GGC TGG AAG GGA GCC TTG TGC AAC GAG TGC GTT CTG GAA CCG AAC TGC 939 
Gly Trp Lys Gly Ala Leu Cys Asn Glu Cys Val Leu Glu Pro Asn Cys 
255 260 265 
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ATC CAT GGC ACC TGC AAC AAA CCC TGG ACT TGC ATC TGC AAC GAG GGT 
He His Gly Thr Cys Asn Lys Pro Trp Thr Cys He Cys Asn Glu Gly 
270 275 280 



987 



TGG GGA GGC TTG TAC TGC AAC CAG GAT CTG AAC TAC TGC ACC AAC CAC 
Trp Gly Gly Leu Tyr Cys Asn Gin Asp Leu Asn Tyr Cys Thr Asn His 
285 290 295 



1035 



AGA CCC TGC AAG AAT GGC GGA ACC TGC TTC AAC ACC GGC GAG GGA TTG 
Arg Pro Cys Lys Asn Gly Gly Thr Cys Phe Asn Thr Gly Glu Gly Leu 
300 305 310 



1083 



TAC ACA TGC AAA TGC GCT CCA GGA TAC AGT GGT GAT GAT TGC GAA AAT 1131 
Tyr Thr Cys Lys Cys Ala Pro Gly Tyr Ser Gly Asp Asp Cys Glu Asn 
315 320 325 330 

GAG ATC TAC TCC TGC GAT GCC GAT GTC AAT CCC TGC CAG AAT GGT GGT 1179 
Glu He Tyr Ser Cys Asp Ala Asp Val Asn Pro Cys Gin Asn Gly Gly 
335 340 345 



ACC TGC ATC GAT GAG CCG CAC ACA AAA ACC GGC TAC AAG TGT CAT TGC 
Thr Cys lie Asp Glu Pro His Thr Lys Thr Gly Tyr Lys Cys His Cys 
350 355 360 



1227 



GCC AAC GGC TGG AGC GGA AAG ATG TGC GAG GAG AAA GTG CTC ACG TGT 1275 
Ala Asn Gly Trp Ser Gly Lys Met Cys Glu Glu Lys Val Leu Thr Cys 
365 370 375 

TCG GAC AAA CCC TGT CAT CAG GGA ATC TGC CGC AAC GTT CGT CCT GGC 1323 
Ser Asp Lys Pro Cys His Gin Gly lie Cys Arg Asn Val Arg Pro Gly 
380 385 390 

TTG GGA AGC AAG GGT CAG GGC TAC CAG TGC GAA TGT CCC ATT GGC TAC 1371 
Leu Gly Ser Lys Gly Gin Gly Tyr Gin Cys Glu Cys Pro lie Gly Tyr 
395 400 405 410 

AGC GGA CCC AAC TGC GAT CTC CAG CTG GAC AAC TGC AGT CCG AAT CCA 1419 
Ser Gly Pro Asn Cys Asp Leu Gin Leu Asp Asn Cys Ser Pro Asn Pro 
415 420 425 

TGC ATA AAC GGT GGA AGC TGT CAG CCG AGC GGA AAG TGT ATT TGC CCA 1467 
Cys He Asn Gly Gly Ser Cys Gin Pro Ser Gly Lys Cys He Cys Pro 
430 435 440 

GCG GGA TTT TCG GGA ACG AGA TGC GAG ACC AAC ATT GAC GAT TGT CTT 1515 
Ala Gly Phe Ser Gly Thr Arg Cys Glu Thr Asn He Asp Asp Cys Leu 
445 450 455 



GGC CAC CAG TGC GAG AAC GGA GGC ACC TGC ATA GAT ATG GTC AAC CAA 
Gly His Gin Cys Glu Asn Gly Gly Thr Cys He Asp Met Val Asn Gin 
460 465 470 



1563 



TAT GGC TGC CAA TGC GTT CCC GGT TTC CAT GGC ACC CAC TGT AGT AGC 
Tyr Arg Cys Gin Cys Val Pro Gly Phe His Gly Thr His Cys Ser Ser 
475 480 485 490 



1611 



AAA GTT GAC TTG TGC CTC ATC AGA CCG TGT GCC AAT GGA GGA ACC TGC 
Lys Val Asp Leu Cys Leu lie Arg Pro Cys Ala Asn Gly Gly Thr Cys 
495 500 * 505 



1659 



TTG AAT CTC AAC AAC GAT TAC CAG TGC ACC TGT CGT GCG GGA TTT ACT 
Leu Asn Leu Asn Asn Asp Tyr Gin Cys Thr Cys Arg Ala Gly Phe Thr 
510 515 520 



1707 



GGC AAG GAT TGC TCT GTG GAC ATC GAT GAG TGC AGC AGT GGA CCC TGT 
Gly Lys Asp Cys Ser Val Asp He Asp Glu Cys Ser Ser Gly Pro Cys 
525 530 535 



1755 
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CAT AAC GGC GGC ACT TGC ATG AAC CGC GTC AAT TCG TTC GAA TGC GTG 1803 
His Asn Gly Gly Thr Cys Met Asn Arg Val Asn Ser Phe Glu Cys Val 
540 " 545 550 

TGT GCC AAT GGT TTC AGG GGC AAG CAG TGC GAT GAG GAG TCC TAC GAT 1851 
Cys Ala Asn Gly Phe Arg Gly Lys Gin Cys Asp Glu Glu Ser Tyr Asp 
555 560 565 570 

TCG GTG ACC TTC GAT GCC CAC CAA TAT GGA GCG ACC ACA CAA GCG AGA 1899 
Ser Val Thr Phe Asp Ala His Gin Tyr Gly Ala Thr Thr Gin Ala Arg 
575 580 585 

GCC GAT GGT TTG ACC AAT GCC CAG GTA GTC CTA ATT GCT GTT TTC TCC 1947 
Ala Asp Gly Leu Thr Asn Ala Gin Val Val Leu He Ala Val Phe Ser 
590 595 600 

GTT GCG ATG CCT TTG GTG GCG GTT ATT GCG GCG TGC GTG GTC TTC TGC 1995 
Val Ala Met Pro Leu Val Ala Val He Ala Ala Cys Val Val Phe Cys 
605 610 615 

ATG AAG CGC AAG CGT AAG CGT GCT CAG GAA AAG GAC GAC GCG GAG GCC 2043 
Met Lys Arg Lys Arg Lys Arg Ala Gin Glu Lys Asp Asp Ala Glu Ala 
620 625 630 

AGG AAG CAG AAC GAA CAG AAT GCG GTG GCC ACA ATG CAT CAC AAT GGC 2091 
Arg Lys Gin Asn Glu Gin Asn Ala Val Ala Thr Met His His Asn Gly 
635 640 645 650 

AGT GGG GTG GGT GTA GCT TTG GCT TCA GCC TCT CTG GGC GGC AAA ACT 2139 
Ser Gly Val Gly Val Ala Leu Ala Ser Ala Ser Leu Gly Gly Lys Thr 
655 660 665 

GGC AGC AAC AGC GGT CTC ACC TTC GAT GGC GGC AAC CCG AAT ATC ATC 2187 
Gly Ser Asn Ser Gly Leu Thr Phe Asp Gly Gly Asn Pro Asn lie He 
670 675 680 

AAA AAC ACC TGG GAC AAG TCG GTC AAC AAC ATT TGT GCC TCA GCA GCA 2235 
Lys Asn Thr Trp Asp Lys v^er Val Asn Asn He Cys Ala Ser Ala Ala 
685 & 690 695 

GCA GCG GCG GCG GCG GCA GCA GCG GCG GAC GAG TGT CTC ATG TAC GGC 2283 
Ala Ala Ala Ala Ala Ala Ala Ala Ala Asp Glu Cys Leu Met Tyr Gly 
700 705 710 

GGA TAT GTG GCC TCG GTG GCG GAT AAC AAC AAT GCC AAC TCA GAC TTT 2331 
Gly Tyr Val Ala Ser Val Ala Asp Asn Asn Asn Ala Asn Ser Asp Phe 
715 720 725 730 

TGT GTG GCT CCG CTA CAA AGA GCC AAG TCG CAA AAG CAA CTC AAC ACC 2379 
Cys Val Ala Pro Leu Gin Arg Ala Lys Ser Gin Lys Gin Leu Asn Thr 
735 740 745 

GAT CCC ACG CTC ATG CAC CGC GGT TCG CCG GCA GGC AGC TCA GCC AAG 2427 
Asp Pro Thr Leu Met His Arg Gly Ser Pro Ala Gly Ser Ser Ala Lys 
750 755 760 

GGA GCG TCT GGC GGA GGA CCG GGA GCG GCG GAG GGC AAG AGG ATC TCT 2475 
Gly Ala Ser Gly Gly Gly Pro Gly Ala Ala Glu Gly Lys Arg He Ser 
765 770 775 

GTT TTA GGC GAG GGT TCC TAC TGT AGC CAG CGT TGG CCC TCG TTG GCG 2523 
Val Leu Gly Glu Gly Ser Tyr Cys Ser Gin Arg Trp Pro Ser Leu Ala 
780 785 790 

GCG GCG GGA GTG GCC GGA GCC TGT TCA TCC CAG CTA ATG GCT GCA GCT 2571 
Ala Ala Gly Val Ala Gly Ala Cys Ser Ser Gin Leu Met Ala Ala Ala 
795 ' 800 805 810 
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TC6 GCA GCG GGC AGC GGA GCG GGG ACG GCG CAA CAG CAG CGA TCC GTG 2619 
Ser Ala Ala Gly Ser Gly Ala Gly Thr Ala Gin Gin Gin Arg Ser Val 
815 820 825 

GTC TGC GGC ACT CCG CAT ATG TAACTCCAAA AATCCGGAAG GGCTCCTGGT 2670 
Val Cys Gly Thr Pro His Met 
830 

AAATCCGGAG AAATCCG CAT GGAGGAGCTG ACAGCACATA CACAAAGAAA AGACTGGGTT 2730 

GGGTTCAAAA TGTGAGAGAG ACGCCAAAAT GTTGTTGTTG ATTGAAGCAG TTTAGTCGTC 2790 

ACG AAAAATG AAAAATCTGT AACAGGCATA ACTCGTAAAC TCCCTAAAAA ATTTGTATAG 2850 

TAATTAGCAA AG CTGTG ACC CAGCCGTTTC GATCCCGAAT TC 2892 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met His Trp lie Lys Cys Leu Leu Thr Ala Phe lie Cys Phe Thr Val 
15 10 15 

lie Val Gin Val His Ser Ser Gly Ser Phe Glu Leu Arg Leu Lys Tyr 
20 25 30 

Phe Ser Asn Asp His Gly Arg Asp Asn Glu Gly Arg Cys Cys Ser Gly 
35 40 45 

Glu Ser Asp Gly Ala Thi^ Gly Lys Cys Leu Gly Ser Cys Lys Thr Arg 
50 55 60 

Phe Arg Val Cys Leu Lys His Tyr Gin Ala Thr lie Asp Thr Thr Ser 
65 70 75 80 

Gin Cys Thr Tyr Gly Asp Val lie Thr Pro lie Leu Gly Glu Asn Ser 
85 90 95 

Val Asn Leu Thr Asp Ala Gin Arg Phe Gin Asn Lys Gly Phe Thr Asn 
100 105 110 

Pro lie Gin Phe Pro Phe Ser Phe Ser Trp Pro Gly Thr Phe Ser Leu 
115 120 125 

lie Val Glu Ala Trp His Asp Thr Asn Asn Ser Gly Asn Ala Arg Thr 
130 135 140 

Asn Lys Leu Leu He Gin Arg Leu Leu Val Gin Gin Val Leu Glu Val 
145 150 155 160 

Ser Ser Glu Trp Lys Thr Asn Lys Ser Glu Ser Gin Tyr Thr Ser Leu 
165 ^ 170 175 

Glu Tyr Asp Phe Arg Val Thr Cys Asp Leu Asn Tyr Tyr Gly Ser Gly 
180 ' 185 190 

Cys Ala Lys Phe Cys Arg Pro Arg Asp Asp Ser Phe Gly His Ser Thr 
195 200 205 
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Cys Ser Glu Thr Gly Glu lie He Cys Leu Thr Gly Trp Gin Gly Asp 
210 215 220 

Tyr Cys His He Pro Lys Cys Ala Lys Gly Cys Glu His Gly His Cys 
225 230 235 240 

Asp Lys Pro Asn Gin Cys Val Cys Gin Leu Gly Trp Lys Gly Ala Leu 
245 250 255 

Cys Asn Glu Cys Val Leu Glu Pro Asn Cys He His Gly Thr Cys Asn 
260 265 270 

Lys Pro Trp Thr Cys He Cys Asn Glu Gly Trp Gly Gly Leu Tyr Cys 
275 280 285 

Asn Gin Asp Leu Asn Tyr Cys Thr Asn His Arg Pro Cys Lys Asn Gly 
290 295 300 

Gly Thr Cys Phe Asn Thr Gly Glu Gly Leu Tyr Thr Cys Lys Cys Ala 
305 310 315 320 

Pro Gly Tyr Ser Gly Asp Asp Cys Glu Asn Glu He Tyr Ser £ys Asp 
325 330 335 

Ala Asp Val Asn Pro Cys Gin Asn Gly Gly Thr Cys He Asp Glu Pro 
340 345 350 

His Thr Lys Thr Gly Tyr Lys Cys His Cys Ala Asn Gly Trp Ser Gly 
355 " 360 365 

Lys Met Cys Glu Glu Lys Val Leu Thr Cys Ser Asp Lys Pro Cys His 
370 375 380 

Gin Gly He Cys Arg Asn Val Arg Pro Gly Leu Gly Ser Lys Gly Gin 
385 390 395 400 

Gly Tyr Gin Cys Glu Cys Pro He Gly Tyr Ser Gly Pro Asn Cys Asp 
405 410 415 

Leu Gin Leu Asp Asn Cys Ser Pro Asn Pro Cys He Asn Gly Gly Ser 
420 425 430 

Cys Gin Pro Ser Gly Lys Cys He Cys Pro Ala Gly Phe Ser Gly Thr 
435 440 445 

Arg Cys Glu Thr Asn He Asp Asp Cys Leu Gly His Gin Cys Glu Asn 
450 455 460 

Gly Gly Thr Cys He Asp Met Val Asn Gin Tyr Arg Cys Gin Cys Val 
465 470 475 480 

Pro Gly Phe His Gly Thr His Cys Ser Ser Lys Val Asp Leu Cys Leu 
485 490 495 

He Arg Pro Cys Ala Asn Gly Gly Thr Cys Leu Asn Leu Asn Asn Asp 
500 505 510 

Tyr Gin Cys Thr Cys Arg Ala Gly Phe Thr Gly Lys Asp Cys Ser Val 
515 520 525 

Asp He Asp Glu Cys Ser Ser Gly Pro Cys His Asn Gly Gly Thr Cys 
530 535 540 

Met Asn Arg Val Asn Ser Phe Glu Cys Val Cys Ala Asn Gly Phe Arg 
545 550 555 560 

Gly Lys Gin Cys Asp Glu Glu Ser Tyr Asp Ser Val Thr Phe Asp Ala 
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565 570 575 

His Gin Tyr Gly Ala Thr Thr Gin Ala Arg Ala Asp Gly Leu Thr Asn 
580 585 590 

Ala Gin Val Val Leu He Ala Val Phe Ser Val Ala Met Pro Leu Val 
595 600 605 

Ala Val lie Ala Ala Cys Val Val Phe Cys Met Lys Arg Lys Arg Lys 
610 615 620 

Arg Ala Gin Glu Lys Asp Asp Ala Glu Ala Arg Lys Gin Asn Glu Gin 
625 630 635 640 

Asn Ala Val Ala Thr Met His His Asn Gly Ser Gly Val Gly Val Ala 
645 650 655 

Leu Ala Ser Ala Ser Leu Gly Gly Lys Thr Gly Ser Asn Ser Gly Leu 
660 665 670 

Thr Phe Asp Gly Gly Asn Pro Asn lie lie Lys Asn Thr Trp Asp Lys 
675 680 685 

Ser Val Asn Asn lie Cys Ala Ser Ala Ala Ala Ala Ala Ala Ala Ala 
690 695 700 

Ala Ala Ala Asp Glu Cys Leu Met Tyr Gly Gly Tyr Val Ala Ser Val 
705 710 715 720 

Ala Asp Asn Asn Asn Ala Asn Ser Asp Phe Cys Val Ala Pro Leu Gin 
725 730 735 

Arg Ala Lys Ser Gin Lys Gin Leu Asn Thr Asp Pro Thr Leu Met His 
740 745 750 

Arg Gly Ser Pro Ala Gly Ser Ser Ala Lys Gly Ala Ser Gly Gly Gly 
755 760 765 

Pro Gly Ala Ala Glu Gly Lys Arg He Ser Val Leu Gly Glu Gly Ser 
770 775 780 

Tyr Cys Ser Gin Arg Trp Pro Ser Leu Ala Ala Ala Gly Val Ala Gly 
785 790 795 800 

Ala Cys Ser Ser Gin Leu Met Ala Ala Ala Ser Ala Ala Gly Ser Gly 
805 810 815 

Ala Gly Thr Ala Gin Gin Gin Arg Ser Val Val Cys Gly Thr Pro His 
820 825 830 

Met 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1320 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 442.. 1320 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CCGAGTCGAG CGCCGTGCTT CGAGCGGTGA TGAGCCCCTT TTCTGTCAAC GCTAAAGATC 60 

TACAAAACAT CAGCGCCTAT CAAGTGGAAG TGTCAAGTGT GAACAAAACA AAAACGAGAG 120 

AAGCACATAC TAAGGTCCAT ATAAATAATA AATAATAATT GTGTGTGATA ACAACATTAT 180 

CCAAACAAAA CCAAACAAAA CGAAGGCAAA GTGGAGAAAA TGATACAGCA TCCAGAGTAC 240 

GGCCGTTATT CAGCTATCCA GAGCAAGTGT AGTGTGGCAA AATAGAAACA AACAAAGGCA 300 

CCAAAATCTG CATACATGGG CTAATTAAGG CTGCCCAGCG AATTTACATT TGTGTGGTGC 360 

CAATCCAGAG TGAATCCGAA ACAAACTCCA TCTAGATCGC CAACCAGCAT CACGCTCGCA 420 

AACGCCCCCA GAATGTACAA A ATG TTT AGG AAA CAT TTT CGG CGA AAA CCA 471 

Met Phe Arg Lys His Phe Arg Arg Lys Pro 
1 5 10 



GCT ACG TCG TCG TCG TTG GAG TCA ACA ATA GAA TCA GCA GAC AGC CTG 519 

Ala Thr Ser Ser Ser Leu Glu Ser Thr lie Glu Ser Ala Asp Ser Leu 

15 20 25 

GGA ATG TCC AAG AAG ACG GCG ACA AAA AGG CAG CGT CCG AGG CAT CGG 567 

Gly Met Ser Lys Lys Thr Ala Thr Lys Arg Gin Arg Pro Arg His Arg 

30 35 40 

GTA CCC AAA ATC GCG ACC CTG CCA TCG ACG ATC CGC GAT TGT CGA TCA 615 

Val Pro Lys lie Ala Thr Leu Pro Ser Thr He Arg Asp Cys Arg Ser 

45 50 55 

TTA AAG TCT GCC TGC AAC TTA ATT GCT TTA ATT TTA ATA CTG TTA GTC 663 

Leu Lys Ser Ala Cys Asn Leu He Ala Leu He Leu lie Leu Leu Val 

60 65 70 

CAT AAG ATA TCC GCA GCT GGT ARC TTC GAG CTG GAA ATA TTA GAA ATC 711 

His Lys lie Ser Ala Ala Gly Asn Phe Glu Leu Glu He Leu Glu lie 

75 80 85 90 

TCA AAT ACC AAC AGC CAT CTA CTC AAC GGC TAT TGC TGC GGC ATG CCA 759 

Ser Asn Thr Asn Ser His Leu Leu Asn Gly Tyr Cys Cys Gly Met Pro 

95 100 105 

GCG GAA CTT AGG GCC ACC AAG ACG ATA GGC TGC TCG CCA TGC ACG ACG 807 

Ala Glu Leu Arg Ala Thr Lys Thr He Gly Cys Ser Pro Cys Thr Thr 

HO 115 120 

GCA TTC CGG CTG TGC CTG AAG GAG TAC CAG ACC ACG GAG CAG GGT GCC 855 

Ala Phe Arg Leu Cys Leu Lys Glu Tyr Gin Thr Thr Glu Gin Gly Ala 

125 130 135 

AGC ATA TCC ACG GGC TGT TCG TTT GGC AAC GCC ACC ACC AAG ATA CTG 903 

Ser He Ser Thr Gly Cys Ser Phe Gly Asn Ala Thr Thr Lys He Leu 

140 145 150 

GGT GGC TCC AGC TTT GTG CTC AGC GAT CCG GGT GTG GGA GCC ATT GTC 951 
Gly Gly Ser S r Phe Val Leu ser Asp Pro Gly Val Gly Ala He Val 

155 160 165 17U 

CTG CCC TTT ACG TTT CGT TGG ACG AAG TCG TTT ACG CTG ATA CTG CAG 999 

Leu Pro Phe Thr Phe Arg Trp Thr Lys Ser Phe Thr Leu II Leu Gin 

175 180 185 

GCG TTG GAT ATG TAC AAC ACA TCC TAT CCA GAT GCG GAG AGG TTA ATT 1047 
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Ala Leu Asp Met Tyr Asn Thr Ser Tyr Pro Asp Ala Glu Arg Leu lie 
190 195 200 

GAG GAA ACA TCA TAC TCG GGC GTG ATA CTG CCG TCG CCG GAG TGG AAG 1095 
Glu Glu Thr Ser Tyr Ser Gly Val lie Leu Pro Ser Pro Glu Trp Lys 
205 210 215 

ACG CTG GAC CAC ATC GGG CGG AAC GCG CGG ATC ACC TAC CGT GTC CGG 1143 
Thr Leu Asp His lie Gly Arg Asn Ala Arg He Thr Tyr Arg Val Arg 
220 225 230 

GTG CAA TGC GCC GTT ACC TAC TAC AAC ACG ACC TGC ACG ACC TTC TGC 1191 
Val Gin Cys Ala Val Thr Tyr Tyr Asn Thr Thr Cys Thr Thr Phe Cys 
235 240 245 250 

CGT CCG CGG GAC GAT CAG TTC GGT CAC TAC GCC TGC GGC TCC GAG GGT 1239 
Arg Pro Arg Asp Asp Gin Phe Gly His Tyr Ala Cys Gly Ser Glu Gly 
255 260 265 

CAG AAG CTC TGC CTG AAT GGC TGG CAG GGC GTC AAC TGC GAG GAG GCC 1287 
Gin Lys Leu Cys Leu Asn Gly Trp Gin Gly Val Asn Cys Glu Glu Ala 
270 275 " 280 

ATA TGC AAG GCG GGC TGC GAC CCC GTC CAC GGC 1320 
He Cys Lys Ala Gly Cys Asp Pro Val His Gly 
285 290 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 293 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Phe Arg Lys His Phe Arg Arg Lys Pro Ala Thr Ser Ser Ser Leu 
1 5 10 15 

Glu Ser Thr lie Glu Ser Ala Asp Ser Leu Gly Met Ser Lys Lys Thr 
20 25 30 

Ala Thr Lys Arg Gin Arg Pro Arg His Arg Val Pro Lys lie Ala Thr 
35 40 45 

Leu Pro Ser Thr lie Arg Asp Cys Arg Ser Leu Lys Ser Ala Cys Asn 
50 55 60 

Leu lie Ala Leu lie Leu lie Leu Leu Val His Lys lie Ser Ala Ala 
65 70 75 80 

Gly Asn Phe Glu Leu Glu lie Leu Glu lie Ser Asn Thr Asn Ser His - 
85 90 95 

Leu Leu Asn Gly Tyr Cys Cys Gly Met Pro Ala Glu Leu Arg Ala Thr 
100 105 110 

Lys Thr lie Gly Cys Ser Pro Cys Thr Thr Ala Phe Arg Leu Cys Leu 
115 120 125 

Lys Glu Tyr Gin Thr Thr Glu Gin Gly Ala Ser lie Ser Thr Gly Cys 
130 135 140 

Ser Phe Gly Asn Ala Thr Thr Lys lie Leu Gly Gly Ser Ser Phe Val 
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155 "0 



Leu Ser Asp Pro Gly Val Gly Ala He Val Leu Pro Phe Thr Phe Arg 
Trp Thr Lys Ser Phe Thr Leu He Leu Gin Ala Leu Asp Met Tyr Asn 
Thr Ser Tyr Pro Asp Ala Glu Arg Leu He Glu Glu Thr Ser Tyr Ser 



195 



Gly val lie Leu Pro Ser Pro Glu Trp Lys Thr Leu Asp His He Gly 



210 



Arg Asn Ala Arg He Thr Tyr Arg Val Arg Val Gin Cys Ala Val Thr 
225 230 235 

Tyr Tyr Asn Thr Thr Cys Thr Thr Phe Cys Arg Pro Arg Asp Asp Gin 
245 

Phe Gly His Tyr Ala Cys Gly Ser Glu Gly Gin Lys Leu Cys Leu Asn 

260 265 
Gly Trp Gin Gly Val Asn Cys Glu Glu Ala He Cys Lys Ala Gly Cys 



275 

Asp Pro Val His Gly 
290 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CGGTGGACTT CCTTCGTGTA TTGGTGGGAG CCCTCGGGAA CGGGGGGTAA CACTGAAAGG 
TCGAGTACCC ATTTCCGTCA TAACGGGTTG GTCGCCCCCT AGGGGTCGGA GTCAGGTGGA 
CGGGAGGTCG ACAACGCCCG GGGGACGGGT GGTACATGGT GTAAGGTCTT TACCGGACCG 
GGCAAACGGG TCACACCGAA AGGGGTGAAC GGTAACTACG GGGTCGTCCT GCCCGTCCAT 240 
CGAGTCTGGT AAGAGGGTCG CCTTAAG 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 574 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : doub le 

(D) TOPOLOGY s unknown 

(ii) MOLECULE TYPE: cDNA 



60 
120 
180 



267 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GAATTCCTTC CATTATACGT GACTTTTCTG AAACTG TAG C CACCCTAGTG TCTCTAACTC 60 

CCTCTGGAGT TTGTCAGCTT TGGTCTTTTC AAAGAGCAGG CTCTCTTCAA GCTCCTTAAT 120 

GCGGGCATGC TCCAGTTTGG TCTGCGTCTC AAGATCACCT TTGGTAATTG ATTCTTCTTC 180 

AACCCGGAAC TGAAGGCTGG CTCTCACCCT CTAGGCAGAG CAGGAATTCC GAGGTGGATG 240 

TGTTAGATGT GAATGTCCGT GGCCCAGATG GCTGCACCCC ATTGATGTTG GCTTCTCTCC 300 

GAGGAGGCAG CTCAGATTTG AGTGATGAAG ATGAAGATGC AGAGGACTGT TCTGCTAACA 360 

TCATCACAGA CTTGGTCTAC CAGGGTGCCA GCCTCCAGNC CAGACAGACC GGACTGGTGA 420 

GATGG CCCTG CACCTTGCAG CCCGCTACTC ACGGGCTGAT GCTGCCAAGC GTCTCCTGGA 480 

TGCAGGTGCA GATGCCAATG CCCAGGACAA CATGGGCCGC TGTCCACTCC ATGCTGCAGT 540 

GGCACGTGAT GCCAAGGTGT ATTCAGATCT GTTA 574 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 295 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: cDNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

TCCAGATTCT GATTCGCAAC CGAGTAACTG ATCTAGATGC CAGGATGAAT GATGGTACTA 60 

CACCCCTGAT CCTGGCTGCC CGCCTGGCTG TGGAGGGAAT GGTGGCAGAA CTGATCAACT 120 

GCCAAG CGGA TGTGAATGCA GTGGATGACC ATGGAAAATC TGCTCTTCAC TGGGCAGCTG 180 

CTGTCAATAA TGTGGAGGCA ACTCTTTTGT TGTTGAAAAA TGGGGCCAAC CGAGACATGC 240 

AGGACAACAA GGAAGAGACA CCTCTGTTTC TTGCTGCCCG GGAGGAGCTA TAAGC 295 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 248 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GAATTCCATT CAGGAGGAAA GGGTGGGGAG AGAAGCAGGC ACCCACTTTC CCGTGGCTGG 60 
ACTCGTTCCC AGGTGGCTCC ACCGGCAGCT GTGACCGCCG CAGGTGGGGG CGGAGTGCCA 120 
TTCAGAAAAT TCCAGAAAAG CCCTACCCCA ACTCGGACGG CAACGTCACA CCCGTGGGTA 180 
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GCAACTGGCA CACAAACAGC CAGCGTGTCT GGGGCACGGG GGGATGGCAC CCCCTGCAGG 240 

248 

CAGAGCTG 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TACGTATCTC GAGCACAGAC AGCTGACGTA CACTTTTNNA GTGCGAGGGA CATTCGTCCG 

ACCAGTACGA ACATTTAGGC TCAGTACGGT AGGTCCATGG CCAAGACTAG GAGACGTAGG 

GAGCTACAGG TCCCGCTCGC TAAACTCGGA CCACTGAAAC CTCCGGTCGA CAGTCGGTAA 

GCGAACAAGA GGGCCAGATC TTAGAGAAGG TGTCGCGGCG AGACTCGGGC TCGGGTCAGG 
CGGCCTTAAG GACGTCGGGC CCNNNAGGTG ATCAAGATCT CGNCNCGGCG GGCGCCACCT 
CGAGGNCGAA AACAAGGGAA ATC 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3234 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..3234 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

TGC CAG GAG GAC GCG GGC AAC AAG GTC TGC AGC CTG CAG TGC AAC AAC 
lys oil llu Asp Ala Gly Asn Lys Val Cys Ser Leu Gin Cys Asn Asn 
1 5 10 A 

CAC GCG TGC GGC TGG GAC GGC GGT GAC TGC TCC CTC AAC TTC AAT GAC 
S52 lit Ss 2J S Asp Gly Gly Asp cys Ser Leu Asn Phe Asn Asp 
20 25 

CCC TGG AAG AAC TGC ACG CAG TCT CTG CAG TGC TGG AAG TAC TTC AGT 
?ro Trl ££s !£n Cys Thr Gin Ser Leu Gin Cys Trp Lys Tyr Phe Ser 
35 40 

GAC GGC CAC TGT GAC AGC CAG TGC AAC TCA GCC GGC TGC CTC TTC GAC 
Sp tly £s Ss Asp Ser Gin Cys Asn Ser Ala Gly Cys Leu Phe Asp 

50 55 60 

GGC TTT GAC TGC CAG CGT GCG GAA GGC CAG TGC AAC CCC CTG TAC GAC 



60 
120 
180 
240 
300 
323 



48 



96 



144 



192 



240 
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Gly Phe Asp Cys Gin Arg Ala Glu Gly Gin Cys Asn Pro Leu Tyr Asp 
65 70 75 80 

CAG TAC TGC AAG GAC CAC TTC AGC GAC GGG CAC TGC GAC CAG GGC TGC 288 
Gin Tyr Cys Lys Asp His Phe Ser Asp Gly His Cys Asp Gin Gly Cys 
85 90 95 

AAC AGC GCG GAG TGC GAG TGG GAC GGG CTG GAC TGT GCG GAG CAT GTA 336 
Asn Ser Ala Glu Cys Glu Trp Asp Gly Leu Asp Cys Ala Glu His Val 
100 105 110 

CCC GAG AGG CTG GCG GCC GGC ACG CTG GTG GTG GTG GTG CTG ATG CCG 384 
Pro Glu Arg Leu Ala Ala Gly Thr Leu Val Val Val Val Leu Met Pro 
115 120 125 

CCG GAG CAG CTG CGC AAC AGC TCC TTC CAC TTC CTG CGG GAG CTC AGC 432 
Pro Glu Gin Leu Arg Asn Ser Ser Phe His Phe Leu Arg Glu Leu Ser 
130 135 140 

CGC GTG CTG CAC ACC AAC GTG GTC TTC AAG CGT GAC GCA CAC GGC CAG 480 
Arg Val Leu His Thr Asn Val Val Phe Lys Arg Asp Ala His Gly Gin 
145 150 155 160 

CAG ATG ATC TTC CCC TAC TAC GGC CGC GAG GAG GAG CTG CGC AAG CAC 528 
Gin Met lie Phe Pro Tyr Tyr Gly Arg Glu Glu Glu Leu Arg Lys His 
165 170 175 

CCC ATC AAG CGT GCC GCC GAG GGC TGG GCC GCA CCT GAC GCC CTG CTG 576 
Pro lie Lys Arg Ala Ala Glu Gly Trp Ala Ala Pro Asp Ala Leu Leu 
180 185 190 

GGC CAG GTG AAG GCC TCG CTG CTC CCT GGT GGC AGC GAG GGT GGG CGG 624 
Gly Gin Val Lys Ala Ser Leu Leu Pro. Gly Gly Ser Glu Gly Gly Arg 
195 200 205 

CGG CGG AGG GAG CTG GAC CCC ATG GAC GTC CGC GGC TCC ATC GTC TAC 672 
Arg Arg Arg Glu Leu Asp Pro Met Asp Val Arg Gly Ser He Val Tyr 
210 . 215 220 

CTG GAG ATT GAC AAC CGG CAG TGT GTG CAG GCC TCC TCG CAG TGC TTC 720 
Leu Glu He Asp Asn Arg Gin Cys Val Gin Ala Ser Ser Gin Cys Phe 
225 230 235 240 

CAG AGT GCC ACC GAC GTG GCC GCA TTC CTG GGA GCG CTC GCC TCG CTG 768 
Gin Ser Ala Thr Asp Val Ala Ala Phe Leu Gly Ala Leu Ala Ser . Leu 
245 250 255 

GGC AGC CTC AAC ATC CCC TAC AAG ATC GAG GCC GTG CAG AGT GAG ACC 816 
Gly Ser Leu Asn lie Pro Tyr Lys lie Glu Ala Val Gin Ser Glu Thr 
260 265 270 

GTG GAG CCG CCC CCG CCG GCG CAG CTG CAC TTC ATG TAC GTG GCG GCG 864 
Val Glu Pro Pro Pro Pro Ala Gin Leu His Phe Met Tyr Val Ala Ala 
275 280 285 

GCC GCC TTT GTG CTT CTG TTC TTC GTG GGC TGC GGG GTG CTG CTG TCC 912 
Ala Ala Phe Val Leu Leu Phe Phe Val Gly Cys Gly Val Leu Leu Ser 
290 295 300 

CGC AAG CGC CGG CGG CAG CAT GGC CAG CTC TGG TTC CCT GAG GGC TTC 960 
Arg Lys Arg Arg Arg Gin His Gly Gin Leu Trp Phe Pro Glu Gly Phe 
305 ~ 310 315 320 

AAA GTG TCT GAG GCC AGC AAG AAG AAG CGG CGG GAG CCC CTC GGC GAG 1008 
Lys Val Ser Glu Ala Ser Lys Lys Lys Arg Arg Glu Pro Leu Gly Glu 
325 330 335 
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GAC TCC GTG GGC CTC AAG CCC CTG AAG AAC GCT TCA GAC GGT GCC CTC 1056 
Asp Ser Val Gly Leu Lys Pro Leu Lys Asn Ala Ser Asp Gly Ala Leu 
340 345 350 

ATG GAC GAC AAC CAG AAT GAG TGG GGG GAC GAG GAC CTG GAG ACC AAG 1104 
Met Asp Asp Asn Gin Asn Glu Trp Gly Asp Glu Asp Leu Glu Thr Lys 
355 360 365 

AAG TTC CGG TTC GAG GAG CCC GTG GTT CTG CCT GAC CTG GAC GAC CAG 1152 
Lys Phe Arg Phe Glu Glu Pro Val Val Leu Pro Asp Leu Asp Asp Gin 
370 375 380 

ACA GAC CAC CGG CAG TGG ACT CAG CAG CAC CTG GAT GCC GCT GAC CTG 1200 
Thr Asp His Arg Gin Trp Thr Gin Gin His Leu Asp Ala Ala Asp Leu 
385 390 395 400 

CGC ATG TCT GCC ATG GCC CCC ACA CCG CCC CAG GGT GAG GTT GAC GCC 1248 
Arq Met Ser Ala Met Ala Pro Thr Pro Pro Gin Gly Glu Val Asp Ala 
405 410 415 

GAC TGC ATG GAC GTC AAT GTC CGC GGG CCT GAT GGC TTC ACC CCG CTC 1296 
Asp Cys Met Asp Val Asn Val Arg Gly Pro Asp Gly Phe Thr Pro Leu 
420 425 430 

ATG ATC GCC TCC TGC AGC GGG GGC GGC CTG GAG ACG GGC AAC AGC GAG 1344 
Met He Ala Ser Cys Ser Gly Gly Gly Leu Glu Thr Gly Asn Ser Glu 
435 440 445 

GAA GAG GAG GAC GCG CCG GCC GTC ATC TCC GAC TTC ATC TAC CAG GGC 1392 
Glu Glu Glu Asp Ala Pro Ala Val He Ser Asp Phe He Tyr Gin Gly 
450 ^ 455 460 

GCC AGC CTG CAC AAC CAG ACA GAC CGC ACG GGC GAG ACC GCC TTG CAC 1440 
Ala Ser Leu His Asn Gin Thr Asp Arg Thr Gly Glu Thr Ala Leu His 
465 470 475 480 

CTG GCC GCC CGC TAC TCA CGC TCT GAT GCC GCC AAG CGC CTG CTG GAG 1488 
Leu Ala Ala Arg Tyr Ser Arg Ser Asp Ala Ala Lys Arg Leu Leu Glu 
485 490 495 

GCC AGC GCA GAT GCC AAC ATC CAG GAC AAC ATG GGC CGC ACC CCG CTG 1536 
Ala Ser Ala Asp Ala Asn He Gin Asp Asn Met Gly Arg Thr Pro Leu 
500 505 510 

CAT GCG GCT GTG TCT GCC GAC GCA CAA GGT GTC TTC CAG ATC CTG ATC 1584 
His Ala Ala Val Ser Ala Asp Ala Gin Gly Val Phe Gin He Leu He 
515 520 525 

CGG AAC CGA GCC ACA GAC CTG GAT GCC CGC ATG CAT GAT GGC ACG ACG 1632 
Arg Asn Arg Ala Thr Asp Leu Asp Ala Arg Met His Asp Gly Thr Thr 
530 535 540 

CCA CTG ATC CTG GCT GCC CGC CTG GCC GTG GAG GGC ATG CTG GAG GAC 1680 
Pro Leu He Leu Ala Ala Arg Leu Ala Val Glu Gly Met Leu Glu Asp 
545 550 555 560- 

CTC ATC AAC TCA CAC GCC GAC GTC AAC GCC GTA GAT GAC CTG GGC AAG 1728 
Leu He Asn Ser His Ala Asp Val Asn Ala Val Asp Asp Leu Gly Lys 
565 570 575 

TCC GCC CTG CAC TGG GCC GCC GCC GTG AAC AAT GTG GAT GCC GCA GTT 1776 
Ser Ala Leu His Trp Ala Ala Ala Val Asn Asn Val Asp Ala Ala Val 
580 585 590 

GTG CTC CTG AAG AAC GGG GCT AAC AAA GAT ATG CAG AAC AAC AGG GAG 1824 
Val Leu L u Lys Asn Gly Ala Asn Lys Asp Met Gin Asn Asn Arg Glu 
595 * 600 605 
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GAG ACA CCC CTG TTT CTG GCC GCC CGG GAG GGC AGC TAC GAG ACC GCC 1872 

Glu Thr Pro Leu Phe Leu Ala Ala Arg Glu Gly Ser Tyr Glu Thr Ala 

610 615 620 

AAG GTG CTG CTG GAC CAC TTT GCC AAC CGG GAC ATC ACG GAT CAT ATG 1920 
Lys Val Leu Leu Asp His Phe Ala Asn Arg Asp lie Thr Asp His Met 
625 630 635 640 

GAC CGC CTG CCG CGC GAC ATC GCA CAG GAG CGC ATG CAT CAC GAC ATC 1968 
Asp Arg Leu Pro Arg Asp lie Ala Gin Glu Arg Met His His Asp lie 
645 650 655 

GTG AGG CTG CTG GAC GAG TAC AAC CTG GTG CGC AGC CCG CAG CTG CAC 2016 
Val Arg Leu Leu Asp Glu Tyr Asn Leu Val Arg Ser Pro Gin Leu His 
660 665 670 

GGA GCC CCG CTG GGG GGC ACG CCC ACC CTG TCG CCC CCG CTC TGC TCG 2064 
Gly Ala Pro Leu Gly Gly Thr Pro Thr Leu Ser Pro Pro Leu Cys Ser 
675 ~ 680 685 

CCC AAC GGC TAC CTG GGC AGC CTC AAG CCC GGC GTG CAG GGC AAG AAG 2112 
Pro Asn Gly Tyr Leu Gly Ser Leu Lys Pro Gly Val Gin Gly Lys Lys 
690 ~ 695 700 

GTC CGC AAG CCC AGC AGC AAA GGC CTG GCC TGT GGA AGC AAG GAG GCC 2160 
Val Arg Lys Pro Ser Ser Lys Gly Leu Ala Cys Gly Ser Lys Glu Ala 
705 710 715 720 

AAG GAC CTC AAG GCA CGG AGG AAG AAG TCC CAG GAT GGC AAG GGC TGC 2208 
Lys Asp Leu Lys Ala Arg Arg Lys Lys Ser Gin Asp Gly Lys Gly Cys 
725 730 735 

CTG CTG GAC AGC TCC GGC ATG CTC TCG CCC GTG GAC TCC CTG GAG TCA 2256 
Leu Leu Asp Ser Ser Gly Met Leu Ser Pro Val Asp Ser Leu Glu Ser 
740 745 750 

CCC CAT GGC TAC CTG TCA GAC GTG GCC TCG CCG CCA CTG CTG CCC TCC 2304 
Pro His Gly Tyr Leu ServAsp Val Ala Ser Pro Pro Leu Leu Pro Ser 
755 760 765 

CCG TTC CAG CAG TCT CCG TCC GTG CCC CTC AAC CAC CTG CCT GGG ATG 2352 
Pro Phe Gin Gin Ser Pro Ser Val Pro Leu Asn His Leu Pro Gly Met 
770 775 780 

CCC GAC ACC CAC CTG GGC ATC GGG CAC CTG AAC GTG GCG GCC AAG CCC 2400 
Pro Asp Thr His Leu Gly He Gly His Leu Asn Val Ala Ala Lys Pro 
785 790 795 800 

GAG ATG GCG GCG CTG GGT GGG GGC GGC CGG CTG GCC TTT GAG ACT GGC 2448 
Glu Met Ala Ala Leu Gly Gly Gly Gly Arg Leu Ala Phe Glu Thr Gly 
805 810 815 

CCA CCT CGT CTC TCC CAC CTG CCT GTG GCC TCT GGC ACC AGC ACC GTC 2496 
Pro Pro Arg Leu Ser His Leu Pro Val Ala Ser Gly Thr Ser Thr Val 
820 825 830 

CTG GGC TCC AGC AGC GGA GGG GCC CTG AAT TTC ACT GTG GGC GGG TCC 2544 
Leu Gly Ser Ser Ser Gly Gly Ala Leu Asn Phe Thr Val Gly Gly Ser 
835 " 840 845 

ACC AGT TTG AAT GGT CAA TGC GAG TGG CTG TCC CGG CTG CAG AGC GGC 2592 
Thr Ser Leu Asn Gly Gin Cys Glu Trp Leu Ser Arg Leu Gin Ser Gly 
850 855 860 

ATG GTG CCG AAC CAA TAC AAC CCT CTG CGG GGG AGT GTG GCA CCA GGC 2640 
Met Val Pro Asn Gin Tyr Asn Pro Leu Arg Gly Ser Val Ala Pro Gly 
865 870 875 .880 
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CCC CTG AGC ACA CAG GCC CCC TCC CTG CAG CAT GGC ATG GTA GGC CCG 2688 
Pro Leu Ser Thr Gin Ala Pro Ser Leu Gin His Gly Met Val Gly Pro 
885 890 895 

CTG CAC AGT AGC CTT GCT GCC AGC GCC CTG TCC CAG ATG ATG AGC TAG 2736 
Leu His Ser Ser Leu Ala Ala Ser Ala Leu Ser Gin Met Met Ser Tyr 
900 905 910 

CAG GGC CTG CCC AGC ACC CGG CTG GCC ACC CAG CCT CAC CTG GTG CAG 2784 
Gin Gly Leu Pro Ser Thr Arg Leu Ala Thr Gin Pro His Leu Val Gin 
915 920 925 

ACC CAG CAG GTG CAG CCA CAA AAC TTA CAG ATG CAG CAG CAG AAC CTG 2832 
Thr Gin Gin Val Gin Pro Gin Asn Leu Gin Met Gin Gin Gin Asn Leu 
930 935 940 

CAG CCA GCA AAC ATC CAG CAG CAG CAA AGC CTG CAG CCG CCA CCA CCA 2880 
Gin Pro Ala Asn He Gin Gin Gin Gin Ser Leu Gin Pro Pro Pro Pro 
945 950 955 960 

CCA CCA CAG CCG CAC CTT GGC GTG AGC TCA GCA GCC AGC GGC CAC CTG 2928 
Pro Pro Gin Pro His Leu Gly Val Ser Ser Ala Ala Ser Gly His Leu 
965 970 975 

GGC CGG AGC TTC CTG AGT GGA GAG CCG AGC CAG GCA GAC GTG CAG CCA 2976 
Gly Arg Ser Phe Leu Ser Gly Glu Pro Ser Gin Ala Asp Val Gin Pro 
980 985 990 

CTG GGC CCC AGC AGC CTG GCG GTG CAC ACT ATT CTG CCC CAG GAG AGC 3024 
Leu Gly Pro Ser Ser Leu Ala Val His Thr He Leu Pro Gin Glu Ser 
995 1000 1005 

CCC GCC CTG CCC ACG TCG CTG CCA TCC TCG CTG GTC CCA CCC GTG ACC 3072 
Pro Ala Leu Pro Thr Ser Leu Pro Ser Ser Leu Val Pro Pro Val Thr 
1010 1015 1020 

GCA GCC CAG TTC CTG ACG CCC CCC TCG CAG CAC AGC TAC TCC TCG CCT 3120 
Ala Ala Gin Phe Leu Thr Pro Pro Ser Gin His Ser Tyr Ser Ser Pro 
1025 1030 1035 1040 

GTG GAC AAC ACC CCC AGC CAC CAG CTA CAG GTG CCT GTT CCT GTA ATG 3168 
Val Asp Asn Thr Pro Ser His Gin Leu Gin Val Pro Val Pro Val Met 
1045 1050 1055 

GTA ATG ATC CGA TCT TCG GAT CCT TCT AAA GGC TCA TCA ATT TTG ATC 3216 
Val Met He Arg Ser Ser Asp Pro Ser Lys Gly Ser Ser He Leu He 
1060 1065 1070 

GAA GCT CCC GAC TCA TGG 3234 
Glu Ala Pro Asp Ser Trp 
1075 



(2) INFORMATION FOR SEQ ID NO: 11 J 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1078 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Cys Gin Glu Asp Ala Gly Asn Lys Val Cys Ser Leu Gin Cys Asn Asn 
15 10 15 
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His Ala Cys Gly Trp Asp Gly Gly Asp Cys Ser Leu Asn Phe Asn Asp 
20 25 30 

Pro Trp Lys Asn Cys Thr Gin Ser Leu Gin Cys Trp Lys Tyr Phe Ser 
35 40 45 

Asp Gly His Cys Asp Ser Gin Cys Asn Ser Ala Gly Cys Leu Phe Asp 
50 55 60 

Gly Phe Asp Cys Gin Arg Ala Glu Gly Gin Cys Asn Pro Leu Tyr Asp 
65 70 75 80 

Gin Tyr Cys Lys Asp His Phe Ser Asp Gly His Cys Asp Gin Gly Cys 
85 90 95 

Asn Ser Ala Glu Cys Glu Trp Asp Gly Leu Asp Cys Ala Glu His Val 
100 105 110 

Pro Glu Arg Leu Ala Ala Gly Thr Leu Val Val Val Val Leu Met Pro 
115 120 125 

Pro Glu Gin Leu Arg Asn Ser Ser Phe His Phe Leu Arg Glu Leu Ser 
130 135 140 

Arg Val Leu His Thr Asn Val Val Phe Lys Arg Asp Ala His Gly Gin 
145 150 155 160 

Gin Met lie Phe Pro Tyr Tyr Gly Arg Glu Glu Glu Leu Arg Lys His 
165 170 175 

Pro lie Lys Arg Ala Ala Glu Gly Trp Ala Ala Pro Asp Ala Leu Leu 
180 185 190 

Gly Gin Val Lys Ala Ser Leu Leu Pro Gly Gly Ser Glu Gly Gly Arg 
195 200 205 

Arg Arg Arg Glu Leu Asp Pro Met Asp Val Arg Gly Ser lie Val Tyr 
210 . 215 220 

Leu Glu lie Asp Asn Arg Gin Cys Val Gin Ala Ser Ser Gin Cys Phe 
225 230 235 240 

Gin Ser Ala Thr Asp Val Ala Ala Phe Leu Gly Ala Leu Ala Ser Leu 
245 250 255 

Gly Ser Leu Asn He Pro Tyr Lys He Glu Ala Val Gin Ser Glu Thr 
260 265 270 

Val Glu Pro Pro Pro Pro Ala Gin Leu His Phe Met Tyr Val Ala Ala 
275 280 285 

Ala Ala Phe Val Leu Leu Phe Phe Val Gly Cys Gly Val Leu Leu Ser 
290 295 300 

Arg Lys Arg Arg Arg Gin His Gly Gin Leu Trp Phe Pro Glu Gly Phe 
305 310 315 320 

Lys Val Ser Glu Ala Ser Lys Lys Lys Arg Arg Glu Pro Leu Gly Glu 
325 330 335 

Asp Ser Val Gly Leu Lys Pro Leu Lys Asn Ala Ser Asp Gly Ala Leu 
340 345 350 

Met Asp Asp Asn Gin Asn Glu Trp Gly Asp Glu Asp Leu Glu Thr Lys 
355 360 365 

Lys Phe Arg Phe Glu Glu Pro Val Val Leu Pro Asp Leu Asp Asp Gin 
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* 370 375 380 

Thr Asp His Arg Gin Trp Thr Gin Gin His Leu Asp Ala Ala Asp Leu 
385 390 395 400 

Arg Met Ser Ala Met Ala Pro Thr Pro Pro Gin Gly Glu Val Asp Ala 
405 410 415 

Asp Cys Met Asp Val Asn Val Arg Gly Pro Asp Gly Phe Thr Pro Leu 
420 425 430 

Met He Ala Ser Cys Ser Gly Gly Gly Leu Glu Thr Gly Asn Ser Glu 
435 440 445 

Glu Glu Glu Asp Ala Pro Ala Val He Ser Asp Phe He Tyr Gin Gly 
450 455 460 

Ala Ser Leu His Asn Gin Thr Asp Arg Thr Gly Glu Thr Ala Leu His 
465 470 475 480 

Leu Ala Ala Arg Tyr Ser Arg Ser Asp Ala Ala Lys Arg Leu Leu Glu 
485 490 495 

Ala Ser Ala Asp Ala Asn He Gin Asp Asn Met Gly Arg Thr Pro Leu 
500 505 " 510 

His Ala Ala Val Ser Ala Asp Ala Gin Gly Val Phe Gin He Leu He 
515 520 525 

Arg Asn Arg Ala Thr Asp Leu Asp Ala Arg Met His Asp Gly Thr Thr 
530 535 540 

Pro Leu He Leu Ala Ala Arg Leu Ala Val Glu Gly Met Leu Glu Asp 
545 550 ~ 555 560 

Leu He Asn Ser His Ala Asp Val Asn Ala Val Asp Asp Leu Gly Lys 
565 570 575 

Ser Ala Leu His Trp Ala Ala Ala Val Asn Asn Val Asp Ala Ala Val 
580 585 590 

Val Leu Leu Lys Asn Gly Ala Asn Lys Asp Met Gin Asn Asn Arg Glu 
595 600 605 

Glu Thr Pro Leu Phe Leu Ala Ala Arg Glu Gly Ser Tyr Glu Thr Ala 
610. 615 620 

Lys Val Leu Leu Asp His Phe Ala Asn Arg Asp He Thr Asp His Met 
625 630 635 640 

Asp Arg Leu Pro Arg Asp He Ala Gin Glu Arg Met His His Asp He 
645 650 655 

Val Arg Leu Leu Asp Glu Tyr Asn Leu Val Arg Ser Pro Gin Leu His 
660 665 670 

Gly Ala Pro Leu Gly Gly Thr Pro Thr Leu Ser Pro Pro Leu Cys Ser 
675 680 685 

Pro Asn Gly Tyr Leu Gly Ser Leu Lys Pro Gly Val Gin Gly Lys Lys 
690 ~ 695 700 

Val Arg Lys Pro Ser Ser Lys Gly Leu Ala Cys Gly Ser Lys Glu Ala 
705 710 715 " 720 

Lys Asp Leu Lys Ala Arg Arg Lys Lys Ser Gin Asp Gly Lys Gly Cys 
725 730 735 
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Leu Leu Asp Ser Ser Gly Met Leu Ser Pro Val Asp Ser Leu Glu Ser 
740 745 750 

Pro His Gly Tyr Leu Ser Asp Val Ala Ser Pro Pro Leu Leu Pro Ser 
755 760 765 

Pro Phe Gin Gin Ser Pro Ser Val Pro Leu Asn His Leu Pro Gly Met 
770 775 780 

Pro Asp Thr His Leu Gly lie Gly His Leu Asn Val Ala Ala Lys Pro 
785 790 795 800 

Glu Met Ala Ala Leu Gly Gly Gly Gly Arg Leu Ala Phe Glu Thr Gly 
805 810 815 

Pro Pro Arg Leu Ser His Leu Pro Val Ala Ser Gly Thr Ser Thr Val 
820 825 ~ 830 

Leu Gly Ser Ser Ser Gly Gly Ala Leu Asn Phe Thr Val Gly Gly Ser 
835 840 845 

Thr Ser Leu Asn Gly Gin Cys Glu Trp Leu Ser Arg Leu Gin Ser Gly 
850 855 860 

Met Val Pro Asn Gin Tyr Asn Pro Leu Arg Gly Ser Val Ala Pro Gly 
865 870 875 880 

Pro Leu Ser Thr Gin Ala Pro Ser Leu Gin His Gly Met Val Gly Pro 
885 890 895 

Leu His Ser Ser Leu Ala Ala Ser Ala Leu Ser Gin Met Met Ser Tyr 
900 905 910 

Gin Gly Leu Pro Ser Thr Arg Leu Ala Thr Gin Pro His Leu Val Gin 
915 920 925 

Thr Gin Gin Val Gin Pro Gin Asn Leu Gin Met Gin Gin Gin Asn Leu 
930 . 935 940 

Gin Pro Ala Asn lie Gin Gin Gin Gin Ser Leu Gin Pro Pro Pro Pro 
945 950 955 960 

Pro Pro Gin Pro His Leu Gly Val Ser Ser Ala Ala Ser Gly His Leu 
965 970 975 

Gly Arg Ser Phe Leu Ser Gly Glu Pro Ser Gin Ala Asp Val Gin Pro 
980 985 990 

Leu Gly Pro Ser Ser Leu Ala Val His Thr lie Leu Pro Gin Glu Ser 
995 1000 1005 

Pro Ala Leu Pro Thr Ser Leu Pro Ser Ser Leu Val Pro Pro Val Thr 
1010 1015 1020 

Ala Ala Gin Phe Leu Thr Pro Pro Ser Gin His Ser Tyr Ser Ser Pro • 
1025 1030 1035 1040 

Val Asp Asn Thr Pro Ser His Gin Leu Gin Val Pro Val Pro Val Met 
1045 1050 1055 

Val Met lie Arg Ser Ser Asp Pro Ser Lys Gly Ser Ser lie Leu lie 
1060 1065 1070 

Glu Ala Pro Asp Ser Trp 
1075 

(2) INFORMATION FOR SEQ ID NO: 12: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4268 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 2.. 1972 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

G GAG GTG GAT GTG TTA GAT GTG AAT GTC CGT GGC CCA GAT GGC TGC 46 
Glu Val Asp Val Leu Asp Val Asn Val Arg Gly Pro Asp Gly Cys 
1 5 10 15 

ACC CCA TTG ATG TTG GCT TCT CTC CGA GGA GGC AGC TCA GAT TTG AGT 94 
Thr Pro Leu Met Leu Ala Ser Leu Arg Gly Gly Ser Ser Asp Leu Ser 
20 25 30 

GAT GAA GAT GAA GAT GCA GAG GAC TCT TCT GCT AAC ATC ATC ACA GAC 142 
Asp Glu Asp Glu Asp Ala Glu Asp Ser Ser Ala Asn lie lie Thr Asp 
35 40 45 

TTG GTC TAC CAG GGT GCC AGC CTC CAG GCC CAG ACA GAC CGG ACT GGT 190 
Leu Val Tyr Gin Gly Ala Ser Leu Gin Ala Gin Thr Asp Arg Thr Gly 
50 55 60 

GAG ATG GCC CTG CAC CTT GCA GCC CGC TAC TCA CGG GCT GAT GCT GCC 238 
Glu Met Ala Leu His Leu Ala Ala Arg Tyr Ser Arg Ala Asp Ala Ala 
65 70 75 

AAG CGT CTC CTG GAT GCA GGT GCA GAT GCC AAT GCC CAG GAC AAC ATG 286 
Lys Arg Leu Leu Asp Ala Gly Ala Asp Ala Asn Ala Gin Asp Asn Met 
80 85 90 95 

GGC CGC TGT CCA CTC CAT GCT GCA GTG GCA GCT GAT GCC CAA GGT GTC 334 
Gly Arg Cys Pro Leu His Ala Ala Val Ala Ala Asp Ala Gin Gly Val 
100 105 110 

TTC CAG ATT CTG ATT CGC AAC CGA GTA ACT GAT CTA GAT GCC AGG ATG 382 
Phe Gin lie Leu lie Arg Asn Arg Val Thr Asp Leu Asp Ala Arg Met 
115 120 125 

AAT GAT GGT ACT ACA CCC CTG ATC CTG GCT GCC CGC CTG GCT GTG GAG 430 
Asn Asp Gly Thr Thr Pro Leu lie Leu Ala Ala Arg Leu Ala Val Glu 
130 135 140 

GGA ATG GTG GCA GAA CTG ATC AAC TGC CAA GCG GAT GTG AAT GCA GTG 478 
Gly Met Val Ala Glu Leu lie Asn Cys Gin Ala Asp Val Asn Ala Val 
145 150 155 

GAT GAC CAT GGA AAA TCT GCT CTT CAC TGG GCA GCT GCT GTC AAT AAT 526 
Asp Asp His Gly Lys Ser Ala Leu His Trp Ala Ala Ala Val Asn Asn 
160 165 170 175 

GTG GAG GCA ACT CTT TTG TTG TTG AAA AAT GGG GCC AAC CGA GAC ATG 574 
Val Glu Ala Thr Leu Leu L u Leu Lys Asn Gly Ala Asn Arg Asp Met 
180 185 * 190 

CAG GAC AAC AAG GAA GAG ACA CCT CTG TTT CTT GCT GCC CGG GAG GGG 622 
Gin Asp Asn Lys Glu Glu Thr Pro Leu Phe Leu Ala Ala Arg Glu Gly 
195 200 205 
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AGC TAT GAA GCA GCC AAG ATC CTG TTA GAC CAT TTT GCC AAT CGA GAC 670 
Ser Tyr Glu Ala Ala Lys lie Leu Leu Asp His Phe Ala Asn Arg Asp 
210 215 220 

ATC ACA GAC CAT ATG GAT CGT CTT CCC CGG GAT GTG GCT CGG GAT CGC 718 
lie Thr Asp His Met Asp Arg Leu Pro Arg Asp Val Ala Arg Asp Arg 
225 230 235 

ATG CAC CAT GAC ATT GTG CGC CTT CTG GAT GAA TAC AAT GTG ACC CCA 766 
Met His His Asp He Val Arg Leu Leu Asp Glu Tyr Asn Val Thr Pro 
240 245 250 255 

AGC CCT CCA GGC ACC GTG TTG ACT TCT GCT CTC TCA CCT GTC ATC TGT 814 
Ser Pro Pro Gly Thr Val Leu Thr Ser Ala Leu Ser Pro Val He Cys 
260 265 270 

GGG CCC AAC AGA TCT TTC CTC AGC CTG AAG CAC ACC CCA ATG GGC AAG 862 
Gly Pro Asn Arg Ser Phe Leu Ser Leu Lys His Thr Pro Met Gly Lys 
275 280 285 

AAG TCT AGA CGG CCC AGT GCC AAG AGT ACC ATG CCT ACT AGC CTC CCT 910 
Lys Ser Arg Arg Pro Ser Ala Lys Ser Thr Met Pro Thr Ser Leu Pro 
290 295 300 

AAC CTT GCC AAG GAG GCA AAG GAT GCC AAG GGT AGT AGG AGG AAG AAG 958 
Asn Leu Ala Lys Glu Ala Lys Asp Ala Lys Gly Ser Arg Arg Lys Lys 
305 310 315 

TCT CTG AGT GAG AAG GTC CAA CTG TCT GAG AGT TCA GTA ACT TTA TCC 1006 
Ser Leu Ser Glu Lys Val Gin Leu Ser Glu Ser Ser Val Thr Leu Ser 
320 325 330 335 

CCT GTT GAT TCC CTA GAA TCT CCT CAC ACG TAT GTT TCC GAC ACC ACA 1054 
Pro Val Asp Ser Leu Glu Ser Pro His Thr Tyr Val Ser Asp Thr Thr 
340 345 350 

TCC TCT CCA ATG ATT ACA TCC CCT GGG ATC TTA CAG GCC TCA CCC AAC 1102 
Ser Ser Pro Met lie Thr Ser Pro Gly He Leu Gin Ala Ser Pro Asn 
355 360 365 

CCT ATG TTG GCC ACT GCC GCC CCT CCT GCC CCA GTC CAT GCC CAG CAT 1150 
Pro Met Leu Ala Thr Ala Ala Pro Pro Ala Pro Val His Ala Gin His 
370 375 380 

GCA CTA TCT TTT TCT AAC CTT CAT GAA ATG CAG CCT TTG GCA CAT GGG 1198 
Ala Leu Ser Phe Ser Asn Leu His Glu Met Gin Pro Leu Ala His Gly 
385 390 395 

GCC AGC ACT GTG CTT CCC TCA GTG AGC CAG TTG CTA TCC CAC CAC CAC 1246 
Ala Ser Thr Val Leu Pro Ser Val Ser Gin Leu Leu Ser His His His 
400 405 410 415 

ATT GTG TCT CCA GGC AGT GGC AGT GCT GGA AGC TTG AGT AGG CTC CAT 1294 
lie Val Ser Pro Gly Ser Gly Ser Ala Gly Ser Leu Ser Arg Leu His 
420 425 430 

CCA GTC CCA GTC CCA GCA GAT TGG ATG AAC CGC ATG GAG GTG AAT GAG 1342 
Pro Val Pro Val Pro Ala Asp Trp Met Asn Arg Met Glu Val Asn Glu 
435 440 445 

ACC CAG TAC AAT GAG ATG TTT GGT ATG GTC CTG GCT CCA GCT GAG GGC 1390 
Thr Gin Tyr Asn Glu Met Phe Gly Met Val Leu Ala Pro Ala Glu Gly 
450 455 460 

ACC CAT CCT GGC ATA GCT CCC CAG AGC AGG CCA CCT GAA GGG AAG CAC 1438 
Thr His Pro Gly He Ala Pro Gin Ser Arg Pro Pro Glu Gly Lys His 
465 470 475 
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ATA ACC ACC CCT CGG GAG CCC TTG CCC CCC ATT GTG ACT TTC CAG CTC 1486 

lie Thr Thr Pro Arg Glu Pro Leu Pro Pro He Val Thr Phe Gin Leu 

480 485 490 495 

ATC CCT AAA GGC AGT ATT GCC CAA CCA GCG GGG GCT CCC CAG CCT CAG 1534 
lie Pro Lys Gly Ser lie Ala Gin Pro Ala Gly Ala Pro Gin Pro Gin 
500 505 510 

TCC ACC TGC CCT CCA GCT GTT GCG GGC CCC CTG CCC ACC ATG TAC CAG 1582 
Ser Thr Cys Pro Pro Ala Val Ala Gly Pro Leu Pro Thr Met Tyr Gin 
515 520 525 

ATT CCA GAA ATG GCC CGT TTG CCC AGT GTG GCT TTC CCC ACT GCC ATG 1630 
lie Pro Glu Met Ala Arg Leu Pro Ser Val Ala Phe Pro Thr Ala Met 
530 535 540 

ATG CCC CAG CAG GAC GGG CAG GTA GCT CAG ACC ATT CTC CCA GCC TAT 1678 
Met Pro Gin Gin Asp Gly Gin Val Ala Gin Thr He Leu Pro Ala Tyr 
545 550 555 

CAT CCT TTC CCA GCC TCT GTG GGC AAG TAC CCC ACA CCC CCT TCA CAG 1726 
His Pro Phe Pro Ala Ser Val Gly Lys Tyr Pro Thr Pro Pro Ser Gin 
560 565 570 575 

CAC AGT TAT GCT TCC TCA AAT GCT GCT GAG CGA ACA CCC AGT CAC AGT 1774 
His Ser Tyr Ala Ser Ser Asn Ala Ala Glu Arg Thr Pro Ser His Ser 
580 585 590 

GGT CAC CTC CAG GGT GAG CAT CCC TAC CTG ACA CCA TCC CCA GAG TCT 1822 
Gly His Leu Gin Gly Glu His Pro Tyr Leu Thr Pro Ser Pro Glu Ser 
595 600 605 

CCT GAC CAG TGG TCA AGT TCA TCA CCC CAC TCT GCT TCT GAC TGG TCA 1870 
Pro Asp Gin Trp Ser Ser Ser Ser Pro His Ser Ala Ser Asp Trp Ser 
610 615 620 

GAT GTG ACC ACC AGC CCT ACC CCT GGG GGT GCT GGA GGA GGT CAG CGG 1918 
Asp Val Thr Thr Ser Pro ^hr Pro Gly Gly Ala Gly Gly Gly Gin Arg 
625 630 635 

GGA CCT GGG ACA CAC ATG TCT GAG CCA CCA CAC AAC AAC ATG CAG GTT 1966 
Gly Pro Gly Thr His Met Ser Glu Pro Pro His Asn Asn Met Gin Val 
640 645 650 655 

TAT GCG TGAGAGAGTC CACCTCCAGT GTAGAGACAT AACTGACTTT TGTAAATGCT 2022 
Tyr Ala 



GCTGAGGAAC AAATGAAGGT CATCCGGGAG AGAAATGAAG AAATCTCTGG AG CC AG CTTC 2082 

TAGA6GTAGG AAAGAGAAGA TGTTCTTATT CAGATAATGC AAGAGAAGCA ATTCGTCAGT 2142 

TTCACTGGGT ATCTGCAAGG CTTATTGATT ATTCTAATCT AATAAGACAA GTTTGTGGAA 2202 

ATG C AAG ATG AATACAAGCC TTGGGTCCAT GTTTACTCTC TTCTATTTGG AGAATAAGAT 2262 

GGATGCTTAT TGAAGCCCAG ACATTCTTGC AGCTTGGACT GCATTTTAAG CCCTGCAGGC 2322 

TTCTGCCATA TCCATGAGAA GATTCTACAC TAGCGTCCTG TTGGGAATTA TGCCCTGGAA 2382 

TTCTGCCTGA ATTGACCTAC GCATCTCCTC CTCCTTGGAC ATTCTTTTGT CTTCATTTGG 2442 

TGCTTTTGGT TTTGCACCTC TCCGTGATTG TAGCCCTACC AGCATGTTAT AGGGCAAGAC 2502 

CTTTGTGCTT TTGATCATTC TGGCCCATGA AAGCAACTTT GGTCTCCTTT CCCCTCCTGT 2562 



CTTCCCGGTA TCCCTTGGAG TCTCACAAGG TTTACTTTGG TATGGTTCTC AGCACAAACC 2622 
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TTTCAAGTAT 


GTTGTTTCTT 


TGGAAAATGG 


ACATACTGTA 


TTGTGTTCTC 


CTGCATATAT 


2682 


CATTCCTGGA 


GAGAGAAGGG 


GAGAAGAATA 


CTTTTCTTCA 


ACAAATTTTG 


GGGGCAGGAG 


2742 


ATCCCTTCAA 


GAGGCTGCAC 


CTTAATTTTT 


CTTGTCTGTG 


TGCAGGTCTT 


CATATAAACT 


2802 


TTACCAGGAA 


GAAGGGTGTG 


AGTTTGTTGT 


TTTTCTGTGT 


ATGGGCCTGG 


TCAGTGTAAA 


2862 


GTTTTATCCT 


TGATAGTCTA 


GTTACTATGA 


CCCTCCCCAC 


TTTTTTAAAA 


CCAGAAAAAG 


2922 


GTTTGGAATG 


TTGGAATGAC 


CAAGAGACAA 


GTTAACTCGT 


GCAAGAGCCA 


GTTACCCACC 


2982 


CACAGGTCCC 


CCTACTTCCT 


GCCAAGCATT 


CCATTGACTG 


CCTGTATGGA 


ACACATTTGT 


3042 


CCCAGATCTG 


AGCATTCTAG 


GCCTGTTTCA 


CTCACTCACC 


C AG CAT ATG A 


AACTAGTCTT 


3102 


AACTGTTGAG 


CCTTTCCTTT 


CATATCCACA 


GAAGACACTG 


TCTCAAATGT 


TGTACCCTTG 


3162 


CCATTTAGGA 


CTGAACTTTC 


CTTAGCCCAA 


GGGACCCAGT 


GACAGTTGTC 


TTCCGTTTGT 


3222 


CAGATGATCA 


GTCTCTACTG 


ATTATCTTGC 


TGCTTAAAGG 


CCTGCTCACC 


AATCTTTCTT 


3282 


TCACACCGTG 


TGGTCCGTGT 


TACTGGTATA 


CCCAGTATGT 


TCTCACTGAA 


GACATGGACT 


3342 


TTATATGTTC AAGTGCAGGA ATTGGAAAGT 


TGGACTTGTT 


TTCTATGATC 


CAAAACAGCC 


3402 


CTATAAGAAG 


GTTGGAAAAG 


GAGGAACTAT 


ATAGCAGCCT 


TTGCTATTTT 


CTGCTACCAT 


3462 


TTCTTTTCCT 


CTGAAGCGGC 


CATGACATTC 


CCTTTGGCAA 


CT AACG TAG A 


AACTCAACAG 


3522 


AACATTTTCC 


TTTCCTAGAG 


TCACCTTTTA 


GATGATAATG 


G ACAACT AT A 


GACTTGCTCA 


3582 


TTGTTCAGAC 


TGATTGCCCC 


TCACCTGAAT 


CCACTCTCTG 


TATTCATGCT 


CTTGGCAATT 


3642 


TCTTTGACTT 


TCTTTTAAGG 


GCAGAAGCAT 


TTTAGTTAAT 


TG TAG AT AAA 


GAATAGTTTT 


3702 


CTTCCTCTTC 


TCCTTGGGCC 


AGTTAATAAT 


TGGTCCATGG 


CTACACTGCA 


ACTTCCGTCC 


3762 


AGTGCTGTGA 


TGCCCATGAC 


ACCTGCAAAA 


TAAGTTCTGC 


CTGGGCATTT 


TGTAGATATT 


3822 


AACAGGTGAA 


TTCCCGACTC 


TTTTGGTTTG 


AATGACAGTT 


CTCATTCCTT 


CTATGGCTGC 


3882 


AAGTATGCAT 


CAGTGCTTCC 


CACTTACCTG 


ATTTGTCTGT 


CGGTGGCCCC 


ATATGGAAAC 


3942 


CCTGCGTGTC 


TGTTGGCATA 


ATAGTTTACA 


AATGGTTTTT 


TC AG TCCT AT 


CCAAATTTAT 


4002 


TGAACCAACA 


AAAATAATTA 


CTTCTGCCCT 


GAGATAAGCA 


GATTAAGTTT 


GTTCATTCTC 


4062 


TGCTTTATTC 


TCTCCATGTG 


GCAACATTCT 


GTCAGCCTCT 


TTCATAG TGT 


GCAAACATTT 


4122 


TATCATTCTA 


AATGGTGACT 


CTCTGCCCTT 


GGACCCATTT 


ATTATTCACA 


GATGGGGAGA 


4182 


ACCTATCTGC 


ATGGACCCTC 


ACCATCCTCT 


GTGCAGCACA 


CACAGTG CAG 


GGAGCCAGTG 


4242 


GCGATGGCGA 


TGACTTTCTT 


CCCCTG 








4268 



<2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 657 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



ID: <WO 9407474A1 I > 
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Glu Val Asp Val Leu Asp Val Asn Val Arg Gly Pro Asp Gly Cys Thr 
15 10 15. 

Pro Leu Met Leu Ala Ser Leu Arg Gly Gly Ser Ser Asp Leu Ser Asp 
20 25 30 

Glu Asp Glu Asp Ala Glu Asp Ser Ser Ala Asn lie He Thr Asp Leu 
35 40 45 

Val Tyr Gin Gly Ala Ser Leu Gin Ala Gin Thr Asp Arg Thr Gly Glu 
50 55 60 

Met Ala Leu His Leu Ala Ala Arg Tyr Ser Arg Ala Asp Ala Ala Lys 
65 70 75 80 

Arg Leu Leu Asp Ala Gly Ala Asp Ala- Asn Ala Gin Asp Asn Met Gly 
85 90 95 

Arg Cys Pro Leu His Ala Ala Val Ala Ala Asp Ala Gin Gly Val Phe 
100 105 110 

Gin lie Leu He Arg Asn Arg Val Thr Asp Leu Asp Ala Arg Met Asn 
115 120 125 

Asp Gly Thr Thr Pro Leu He Leu Ala Ala Arg Leu Ala Val Glu Gly 
130 135 140 

Met Val Ala Glu Leu lie Asn Cys Gin Ala Asp Val Asn Ala Val Asp 
145 150 155 160 

Asp His Gly Lys Ser Ala Leu His Trp Ala Ala Ala Val Asn Asn Val 
165 170 175 

Glu Ala Thr Leu Leu Leu Leu Lys Asn Gly Ala Asn Arg Asp Met Gin 
180 185 190 

Asp Asn Lys Glu Glu Thr Pro Leu Phe Leu Ala Ala Arg Glu Gly Ser 
195 200 205 

Tyr Glu Ala Ala Lys He Leu Leu Asp His Phe Ala Asn Arg Asp He 
210 215 220 

Thr Asp His Met Asp Arg Leu Pro Arg Asp Val Ala Arg Asp Arg Met 
225 ' 230 235 240 

His His Asp He Val Arg Leu Leu Asp Glu Tyr Asn Val Thr Pro Ser 
245 250 255 

Pro Pro Gly Thr Val Leu Thr Ser Ala Leu Ser Pro Val He Cys Gly 
260 265 270 

Pro Asn Arg Ser Phe Leu Ser Leu Lys His Thr Pro Met Gly Lys Lys 
275 280 285 

Ser Arg Arg Pro Ser Ala Lys Ser Thr Met Pro Thr Ser Leu Pro Asn 
290 295 300 

Leu Ala Lys Glu Ala Lys Asp Ala Lys Gly Ser Arg Arg Lys Lys Ser 
305 310 315 320 

Leu Ser Glu Lys Val Gin Leu Ser Glu Ser Ser Val Thr Leu Ser Pro 
325 330 335 

Val Asp Ser Leu Glu Ser Pro His Thr Tyr Val Ser Asp Thr Thr S r 
340 345 350 

Ser Pro Met He Thr Ser Pro Gly He Leu Gin Ala Ser Pro Asn Pro 
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355 360 365 

Met Leu Ala Thr Ala Ala Pro Pro Ala Pro Val His Ala Gin His Ala 
370 375 380 

Leu Ser Phe Ser Asn Leu His Glu Met Gin Pro Leu Ala His Gly Ala 
385 390 395 400 

Ser Thr Val Leu Pro Ser Val Ser Gin Leu Leu Ser His His His lie 
405 410 415 

Val Ser Pro Gly Ser Gly Ser Ala Gly Ser Leu Ser Arg Leu His Pro 
420 425 430 

Val Pro Val Pro Ala Asp Trp Met Asn Arg Met Glu Val Asn Glu Thr 
435 440 445 

Gin Tyr Asn Glu Met Phe Gly Met Val Leu Ala Pro Ala Glu Gly Thr 
450 455 460 

His Pro Gly lie Ala Pro Gin Ser Arg Pro Pro Glu Gly Lys His lie 
465 470 475 480 

Thr Thr Pro Arg Glu Pro Leu Pro Pro lie Val Thr Phe Gin Leu lie 
485 490 495 

Pro Lys Gly Ser lie Ala Gin Pro Ala Gly Ala Pro Gin Pro Gin Ser 
500 505 510 

Thr Cys Pro Pro Ala Val Ala Gly Pro Leu Pro Thr Met Tyr Gin lie 
515 520 525 

Pro Glu Met Ala Arg Leu Pro Ser Val Ala Phe Pro Thr Ala Met Met 
530 535 540 

Pro Gin Gin Asp Gly Gin Val Ala Gin Thr lie Leu Pro Ala Tyr His 
545 550 555- 560 

Pro Phe Pro Ala Ser Val Gly Lys Tyr Pro Thr Pro Pro Ser Gin His 
565 570 575 

Ser Tyr Ala Ser Ser Asn Ala Ala Glu Arg Thr Pro Ser His Ser Gly 
580 585 590 

His Leu Gin Gly Glu His Pro Tyr Leu Thr Pro Ser Pro Glu Ser Pro 
595 600 605 

Asp Gin Trp Ser Ser Ser Ser Pro His Ser Ala Ser Asp Trp Ser Asp 
610 615 620 

Val Thr Thr Ser Pro Thr Pro Gly Gly Ala Gly Gly Gly Gin Arg Gly 
625 630 635 640 

Pro Gly Thr His Met Ser Glu Pro Pro His Asn Asn Met Gin Val Tyr 
645 650 655 

Ala 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 
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<ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Glu Asp lie Asp Glu Cys Asp Gin Gly Ser Pro Cys Glu His Asn Gly 
1 5 * 10 15 

lie Cys Val Asn Thr Pro Gly Ser Tyr Arg Cys Asn Cys Ser Gin Gly 
20 25 30 

Phe Thr Gly Pro Arg Cys Glu Thr Asn lie Asn Glu Cys Glu Ser His 
35 40 45 

Pro Cys Gin Asn Glu Gly Ser Cys Leu Asp Asp Pro Gly Thr Phe Arg 
50 ~ 55 60 

Cys Val Cys Met Pro Gly Phe Thr Gly Thr Gin Cys Glu 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Asn Asp Val Asp Glu Cys Ser Leu Gly Ala Asn Pro Cys Glu His Gly 
1 5 10 15 

Gly Arg Cys Thr Asn Thr Leu Gly Ser Phe Gin Cys Asn Cys Pro Gin 
20 25 30 

Gly Tyr Ala Gly Pro Arg Cys Glu lie Asp Val Asn Glu Cys Leu Ser 
35 40 45 

Asn Pro Cys Gin Asn Asp Ser Thr Cys Leu Asp Gin lie Gly Glu Phe 
50 55 60 

Gin Cys lie Cys Met Pro Gly Tyr Glu Gly Leu Tyr Cys Glu 
65 * 70 75 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 654 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
Thr Pro Pro Gin Gly Glu lie Glu Ala Asp Cys Met Asp Val Asn Val 
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15 10 15 

Arg Gly Pro Asp Gly Phe Thr Pro Leu Met He Ala Ser Cys Ser Gly 
20 25 30 

Gly Gly Leu Glu Thr Gly Asn Ser Glu Glu Glu Glu Asp Ala Ser Ala 
35 40 45 

Asn Met lie Ser Asp Phe He Gly Gin Gly Ala Gin Leu His Asn Gin 
50 55 60 

Thr Asp Arg Thr Gly Glu Thr Ala Leu His Leu Ala Ala Arg Tyr Ala 
65 70 75 80 

Arg Ala Asp Ala Ala Lys Arg Leu Leu Glu Ser Ser Ala Asp Ala Asn 
85 90 95 

Val Gin Asp Asn Met Gly Arg Thr Pro Leu His Ala Ala Val Ala Ala 
100 105 110 

Asp Ala Gin Gly Val Phe Gin He Leu He Arg Asn Arg Ala Thr Asp 
115 * 120 125 

Leu Asp Ala Arg Met Phe Asp Gly Thr Thr Pro Leu lie Leu Ala Ala 
130 135 140 

Arg Leu Ala Val Glu Gly Met Val Glu Glu Leu He Asn Ala His Ala 
145 150 155 160 

Asp Val Asn Ala Val Asp Glu Phe Gly Lys Ser Ala Leu His Trp Ala 
165 170 175 

Ala Ala Val Asn Asn Val Asp Ala Ala Ala Val Leu Leu Lys Asn Ser 
180 185 190 

Ala Asn Lys Asp Met Gin Asn Asn Lys Glu Glu Thr Ser Leu Phe Leu 
195 200 205 

Ala Ala Arg Glu Gly Ser Tyr Glu Thr Ala Lys Val Leu Leu Asp His 
210 215 220 

Tyr Ala Asn Arg Asp He Thr Asp His Met Asp Arg Leu Pro Arg Asp 
225 230 235 240 

lie Ala Gin Glu Arg Met His His Asp He Val His Leu Leu Asp Glu 
245 250 255 

Tyr Asn Leu Val Lys Ser Pro Thr Leu His Asn Gly Pro Leu Gly Ala 
260 265 270 

Thr Thr Leu Ser Pro Pro He Cys Ser Pro Asn Gly Tyr Met Gly Asn 
275 280 285 

Met Lys Pro Ser Val Gin Ser Lys Lys Ala Arg Lys Pro Ser He Lys 
290 295 300 

Gly Asn Gly Cys Lys Glu Ala Lys Glu Leu Lys Ala Arg Arg Lys Lys 
305 * ~ 310 315 320 

Ser Gin Asp Gly Lys Thr Thr Leu Leu Asp Ser Gly Ser Ser Gly Val 
325 330 335 

Leu Ser Pro Val Asp Ser Leu Glu Ser Thr His Gly Tyr Leu Ser Asp 
340 345 . 350 

Val Ser Ser Pro Pro Leu Met Thr Ser Pro Phe Gin Gin S r Pro Ser 
355 360 365 
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Met Pro Leu Asn His Leu Thr Ser Met Pro Glu Ser Gin Leu Gly Met 
370 375 380 

Asn His He Asn Met Ala Thr Lys Gin Glu Met Ala Ala Gly Ser Asn 
385 390 395 400 

Arg Met Ala Phe Asp Ala Met Val Pro Arg Leu Thr His Leu Asn Ala 
405 410 415 

Ser Ser Pro Asn Thr He Met Ser Asn Gly Ser Met His Phe Thr Val 
420 425 430 

Gly Gly Ala Pro Thr Met Asn Ser Gin Cys Asp Trp Leu Ala Arg Leu 
435 440 445 

Gin Asn Gly Met Val Gin Asn Gin Tyr Asp Pro lie Arg Asn Gly He 
450 455 ~ 460 

Gin Gin Gly Asn Ala Gin Gin Ala Gin Ala Leu Gin His Gly Leu Met 
465 470 475 480 

Thr Ser Leu His Asn Gly Leu Pro Ala Thr Thr Leu Ser Gin Met Met 
485 490 495 

Thr Tyr Gin Ala Met Pro Asn Thr Arg Leu Ala Asn Gin Pro His Leu 
500 505 510 

Met Gin Ala Gin Gin Met Gin Gin Gin Gin Asn Leu Gin Leu His Gin 
515 520 525 

Ser Met Gin Gin Gin His His Asn Ser Ser Thr Thr Ser Thr His He 
530 535 540 

Asn Ser Pro Phe Cys Ser Ser Asp He Ser Gin Thr Asp Leu Gin Gin 
545 550 555 560 

Met Ser Ser Asn Asn He His Ser Val Met Pro Gin Asp Thr Gin He 
565 570 575 

Phe Ala Ala Ser Leu Pro Ser Asn Leu Thr Gin Ser Met Thr Thr Ala 
580 565 590 

Gin Phe Leu Thr Pro Pro Ser Gin His Ser Tyr Ser Ser Pro Met Asp 
595 600 605 

Asn Thr Pro Ser His Gin Leu Gin Val Pro Asp His Pro Phe Leu Thr 
610 615 620 

Pro Ser Pro Glu Ser Pro Asp Gin Trp Ser Ser Ser Ser Pro His Ser 
625 630 * 635 640 

Asn Met Ser Asp Trp Ser Glu Gly He Ser Ser Pro Pro Thr 
645 650 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 666 amino acids 

(B) TYPE: amino acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
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Thr Pro Pro Gin Gly Glu Val Asp Ala Asp Cys Met Asp Val Asn Val 
1 5 10 15 

Arg Gly Pro Asp Gly Phe Thr Pro Leu Met lie Ala Ser Cys Ser Gly 
20 25 30 

Gly Gly Leu Glu Thr Gly Asn Ser Glu Glu Glu Glu Asp Ala Pro Ala 
35 40 45 

Val lie Ser Asp Phe lie Tyr Gin Gly Ala Ser Leu His Asn Gin Thr 
50 ~ 55 60 

Asp Arg Thr Gly Glu Thr Ala Leu His Leu Ala Ala Arg Tyr Ser Arg 
65 70 75 80 

Ser Asp Ala Ala Lys Arg Leu Leu Glu Ala Ser Ala Asp Ala Asn lie 
85 90 95 

Gin Asp Asn Met Gly Arg Thr Pro Leu His Ala Ala Val Ser Ala Asp 
100 105 110 

Ala Gin Gly Val Phe Gin lie Leu Leu Arg Asn Arg Ala Thr Asp Leu 
115 120 125 

Asp Ala Arg Met His Asp Gly Thr Thr Pro Leu He Leu Ala Ala Arg 
130 " 135 140 

Leu Ala Val Glu Gly Met Leu Glu Asp Leu He Asn Ser His Ala Asp 
145 150 155 160 

Val Asn Ala Val Asp Asp Leu Gly Lys Ser Ala Leu His Trp Ala Ala 
165 170 175 

Ala Val Asn Asn Val Asp Ala Ala Val Val Leu Leu Lys Asn Gly Ala 
180 185 190 

Asn Lys Asp Met Gin Asn Asn Lys Glu Glu Thr Pro Leu Phe Leu Ala 
195 200 205 

Ala Arg Glu Gly Ser Tyr Glu Thr Ala Lys Val Leu Leu Asp His Phe 
210 215 220 

Ala Asn Arg Asp lie Thr Asp His Met Asp Arg Leu Pro Arg Asp He 
225 230 235 240 

Ala Gin Glu Arg Met His His Asp He Val Arg Leu Leu Asp Glu Tyr 
245 250 255 

Asn Leu Val Arg Ser Pro Gin Leu His Gly Thr Ala Leu Gly Gly Thr 
260 265 270 

Pro Thr Leu Ser Pro Thr Leu Cys Ser Pro Asn Gly Tyr Leu Gly Asn 
275 280 285 

Leu Lys Ser Ala Thr Gin Gly Lys Lys Ala Arg Lys Pro Ser Thr Lys 
290 295 300 

Gly Leu Ala Cys Ser Ser Lys Glu Ala Lys Asp Leu Lys Ala Arg Arg 
305 310 315 320 

Lys Lys Ser Gin Asp Gly Lys Gly Cys Leu Leu Asp Ser Ser Ser Met 
325 330 335 

Leu Ser Pro Val Asp Ser Leu Glu Ser Pro His Gly Tyr Leu Ser Asp 
340 345 350 

Val Ala Ser Pro Pro Leu Pro S r Pro Phe Gin Gin Ser Pro Ser Met 
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355 360 365 

Pro Leu Ser His Leu Pro Gly Met Pro Asp Thr His Leu Gly He Ser 
370 375 380 

His Leu Asn Val Ala Ala Lys Pro Glu Met Ala Ala Leu Ala Gly Gly 
385 390 395 400 

Ser Arg Leu Ala Phe Glu Pro Pro Pro Pro Arg Leu ser His Leu Pro 
405 410 415 

Val Ala Ser Ser Ala Ser Thr Val Leu Ser Thr Asn Gly Thr Gly Ala 
420 425 430 

Met Asn Phe Thr Val Gly Ala Pro Ala Ser Leu Asn Gly Gin Cys Glu 
435 440 445 

Trp Leu Pro Arg Leu Gin Asn Gly Met Val Pro Ser Gin Tyr Asn Pro 
450 455 460 

Leu Arg Pro Gly Val Thr Pro Gly Thr Leu Ser Thr Gin Ala Ala Gly 
465 470 475 480 

Leu Gin His Gly Met Met Ser Pro He His Ser Ser Leu Ser Thr Asn 
485 490 495 

Thr Leu Ser Pro lie lie Tyr Gin Gly Leu Pro Asn Thr Arg Leu Ala 
500 505 510 

Thr Gin Pro His Leu Val Gin Thr Gin Gin Val Gin Pro Gin Asn Leu 
515 520 525 

Gin lie Gin Pro Gin Asn Leu Gin Pro Pro Ser Gin Pro His Leu Ser 
530 535 540 

Val Ser Ser Ala Ala Asn Gly His Leu Gly Arg Ser Phe Leu Ser Gly 
545 550 555 560 

Glu Pro Ser Gin Ala Asp Val Gin Pro Leu Gly Pro Ser Ser Leu Pro 
565 570 575 

Val His Thr lie Leu Pro Gin Glu Ser Gin Ala Leu Pro Thr Ser Leu 
580 585 590 

Pro Ser Ser Met Val Pro Pro Met Thr Thr Thr Gin Phe Leu Thr Pro 
595 600 605 

Pro Ser Gin His Ser Tyr Ser Ser Ser Pro Val Asp Asn Thr Pro Ser 
610 615 620 

His Gin Leu Gin Val Pro Glu His Pro Phe Leu Thr Pro Ser Pro Glu 
625 630 635 640 

Ser Pro Asp Gin Trp Ser Ser Ser Ser Arg His Ser Asn He Ser Asp 
645 650 655 

Trp Ser Glu Gly He Ser Ser Pro Pro Thr 
660 665 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 681 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Thr Pro Pro Gin Gly Glu Val Asp Ala Asp Cys Met Asp Val Asn Val 
15 10 15 

Arg Gly Pro Asp Gly Phe Thr Pro Leu Met lie Ala Ser Cys Ser Gly 
20 25 30 

Gly Gly Leu Glu Thr Gly Asn Ser Glu Glu Glu Glu Asp Ala Pro Ala 
35 40 45 

Val lie Ser Asp Phe lie Tyr Gin Gly Ala Ser Leu His Asn Gin Thr 
50 55 60 

Asp Arg Thr Gly Glu Thr Ala Leu His Leu Ala Ala Arg Tyr Ser Arg 
65 70 75 80 

Ser Asp Ala Ala Lys Arg. Leu Leu Glu Ala Ser Ala Asp Ala Asn lie 
85 90 95 

Gin Asp Asn Met Gly Arg Thr Pro Leu His Ala Ala Val Ser Ala Asp 
100 105 110 

Ala Gin Gly Val Phe Gin lie Leu lie Arg Asn Arg Ala Thr Asp Leu 
115 120 125 

Asp Ala Arg Met His Asp Gly Thr Thr Pro Leu lie Leu Ala Ala Arg 
130 135 140 

Leu Ala Val Glu Gly Met Leu Glu Asp Leu lie Asn Ser His Ala Asp 
145 150 155 160 

Val Asn Ala Val Asp Asp Leu Gly Lys Ser Ala Leu His Trp Ala Ala 
165 170 175 

Ala Val Asn Asn Val Asp Ala Ala Val Val Leu Leu Lys Asn Gly Ala 
180 185 190 

Asn Lys Asp Met Gin Asn Asn Arg Glu Glu Thr Pro Leu Phe Leu Ala 
195 200 205 

Ala Arg Glu Gly Ser Tyr Glu Thr Ala Lys Val Leu Leu Asp His Phe 
210 215 220 

Ala Asn Arg Asp lie Thr Asp His Met Asp Arg Leu Pro Arg Asp lie 
225 230 235 240 

Ala Gin Glu Arg Met His His Asp lie Val Arg Leu Leu Asp Glu Tyr 
245 250 255 

Asn Leu Val Arg Ser Pro Gin Leu His Gly Ala Pro Leu Gly Gly Thr 
260 265 270 

Pro Thr Leu Ser Pro Pro Leu Cys Ser Pro Asn Gly Tyr Leu Gly Ser 
275 280 285 

Leu Lys Pro Gly Val Gin Gly Lys Lys Val Arg Lys Pro Ser Ser Lys 
290 295 300 

Gly Leu Ala Cys Gly Ser Lys Glu Ala Lys Asp Leu Lys Ala Arg Arg 
305 310 315 320 

Lys Lys Ser Gin Asp Gly Lys Gly Cys Leu Leu Asp Ser Ser Gly Met 
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325 330 335 

Leu Ser Pro Val Asp Ser Leu Glu Ser Pro His Gly Tyr Leu Ser Asp 
340 345 350 

Val Ala Ser Pro Pro Leu Leu Pro Ser Pro Phe Gin Gin Ser Pro Ser 
355 360 365 

Val Pro Leu Asn His Leu Pro Gly Met Pro Asp Thr His Leu Gly He 
370 375 380 

Gly His Leu Asn Val Ala Ala Lys Pro Glu Met Ala Ala Leu Gly Gly 
385 390 395 400 

Gly Gly Arg Leu Ala Phe Glu Thr Gly Pro Pro Arg Leu Ser His Leu 
405 410 415 

Pro Val Ala Ser Gly Thr Ser Thr Val Leu Gly Ser Ser Ser Gly Gly 
420 425 430 

Ala Leu Asn Phe Thr Val Gly Gly Ser Thr Ser Leu Asn Gly Gin Cys 
435 440 445 

Glu Trp Leu Ser Arg Leu Gin Ser Gly Met Val Pro Asn Gin Tyr Asn 
450 455 460 

Pro Leu Arg Gly Ser Val Ala Pro Gly Pro Leu Ser Thr Gin Ala Pro 
465 470 475 480 

Ser Leu Gin His Gly Met Val Gly Pro Leu His Ser Ser Leu Ala Ala 
485 490 495 

Ser Ala Leu Ser Gin Met Met Ser Tyr Gin Gly Leu Pro Ser Thr Arg 
500 505 510 

Leu Ala Thr Gin Pro His Leu Val Gin Thr Gin Gin Val Gin Pro Gin 
515 520 525 

-v 

Asn Leu Gin Met Gin Gin Gin Asn Leu Gin Pro Ala Asn He Gin Gin 
530 535 540 

Gin Gin Ser Leu Gin Pro Pro Pro Pro Pro Pro Gin Pro His Leu Gly 
545 550 555 560 

Val Ser Ser Ala Ala Ser Gly His Leu Gly Arg Ser Phe Leu Ser Gly 
565 570 575 

Glu Pro Ser Gin Ala Asp Val Gin Pro Leu Gly Pro Ser Ser Leu Ala 
580 585 590 

Val His Thr He Leu Pro Gin Glu Ser Pro Ala Leu Pro Thr Ser Leu 
5>?5 600 605 

Pro Ser Ser Leu Val Pro Pro Val Thr Ala Ala Gin Phe Leu Thr Pro 
610 615 620 

Pro Ser Gin His Ser Tyr Ser Ser Pro Val Glu Asn Thr Pro Ser His 
625 630 635 640 

Gin Leu Gin Val Pro Glu His Pro Phe Leu Thr Pro Ser Pro Glu Ser 
645 650 655 

Pro Asp Gin Trp Ser Ser Ser Ser Pro His Ser Asn Val Ser Asp Trp 
660 665 670 

Ser Glu Gly Val Ser Ser Pro Pro Thr 
675 680 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2471 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Pro Ala Leu Arg Pro Ala Leu Leu Trp Ala Leu Leu Ala Leu Trp 
15 10 15 

Leu Cys Cys Ala Ala Pro Ala His Ala Leu Gin Cys Arg Asp Gly Tyr 
20 25 30 

Glu Pro Cys Val Asn Glu Gly Met Cys Val Thr Tyr His Asn Gly Thr 
35 40 45 

Gly Tyr Cys Lys Cys Pro Glu Gly Phe Leu Gly Glu Tyr Cys Gin His 
50 55 60 

Arg Asp Pro Cys Glu Lys Asn Arg Cys Gin Asn Gly Gly Thr Cys Val 
65 70 75 80 

Ala Gin Ala Met Leu Gly Lys Ala Thr Cys Arg Cys Ala Ser Gly Phe 
85 90 95 

Thr Gly Glu Asp Cys Gin Tyr Ser Thr Ser His Pro Cys Phe Val Ser 
100 105 110 

Arg Pro Cys Leu Asn Gly Gly Thr Cys His Met Leu Ser Arg Asp Thr 
115 120 125 

\ 

Tyr Glu Cys Thr Cys Gin Val Gly Phe Thr Gly Lys Glu Cys Gin Trp 
130 135 140 

Thr Asp Ala Cys Leu Ser His Pro Cys Ala Asn Gly Ser Thr Cys Thr 
145 150 155 160 

Thr Val Ala Asn Gin Phe Ser Cys Lys Cys Leu Thr Gly Phe Thr Gly 
165 170 175 

Gin Lys Cys Glu Thr Asp Val Asn Glu Cys Asp lie Pro Gly His Cys 
180 185 190 

Gin His Gly Gly Thr Cys Leu Asn Leu Pro Gly Ser Tyr Gin Cys Gin 
195 200 205 

Cys Pro Gin Gly Phe Thr Gly Gin Tyr Cys Asp Ser Leu Tyr Val Pro 
210 215 220 

Cys Ala Pro Ser Pro Cys Val Asn Gly Gly Thr Cys Arg Gin Thr Gly 
225 230 235 240 

Asp Phe Thr Phe Glu Cys Asn Cys Leu Pro Gly Phe Glu Gly Ser Thr 
245 250 255 

Cys Glu Arg Asn lie Asp Asp Cys Pro Asn His Arg Cys Gin Asn Gly 
260 265 270 

Gly Val Cys Val Asp Gly Val Asn Thr Tyr Asn Cys Arg Cys Pro Pro 
275 280 285 
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Gln Trp Thr Gly Gin Phe Cys Thr Glu Asp Val Asp Glu Cys Leu Leu 
290 295 300 

Gin Pro Asn Ala Cys Gin Asn Gly Gly Thr Cys Ala Asn Arg Asn Gly 
305 310 315 320 

Glv Tvr Gly Cys Val Cys Val Asn Gly Trp Ser Gly Asp Asp Cys Ser 
325 330 335 

Glu Asn lie Asp Asp Cys Ala Phe Ala Ser Cys Thr Pro Gly Ser Thr 
340 345 350 

Cys He Asp Arg Val Ala Ser Phe Ser Cys Met Cys Pro Glu Gly Lys 
355 360 365 

Ala Gly Leu Leu Cys His Leu Asp Asp Ala Cys He Ser Asn Pro Cys 
370 375 380 

His Lys Gly Ala Leu Cys Asp Thr Asn Pro Leu Asn Gly Gin Tyr He 
385 390 395 400 

Cys Thr Cys Pro Gin Gly Tyr Lys Gly Ala Asp Cys Thr Glu Asp Val 
405 410 415 

Asp Glu Cys Ala Met Ala Asn Ser Asn Pro Cys Glu His Ala Gly Lys 
420 425 430 

Cys Val Asn Thr Asp Gly Ala Phe His Cys Glu Cys Leu Lys Gly Tyr 
435 ~ 440 445 

Ala Gly Pro Arg Cys Glu Met Asp lie Asn Glu Cys His Ser Asp Pro 
450 ~ 455 460 

Cys Gin Asn Asp Ala Thr Cys Leu Asp Lys He Gly Gly Phe Thr Cys 
465 470 475 480 

Leu Cys Met Pro Gly Phe Lys Gly Val His Cys Glu Leu Glu He Asn 
485 490 495 

Glu Cys Gin Ser Asn Pro Cys Val Asn Asn Gly Gin Cys Val Asp Lys 
500 505 510 

Val Asn Arg Phe Gin Cys Leu Cys Pro Pro Gly Phe Thr Gly Pro Val 
515 520 525 

Cys Gin He Asp He Asp Asp Cys Ser Ser Thr Pro Cys Leu Asn Gly 
530 " 535 540 

Ala Lys Cys He Asp His Pro Asn Gly Tyr Glu Cys Gin Cys Ala Thr 
545 * * 550 555 560 

Gly Phe Thr Gly Val Leu Cys Glu Glu Asn He Asp Asn Cys Asp Pro 
565 ^ 570 575 

Asp Pro Cys His His Gly Gin Cys Gin Asp Gly lie Asp Ser Tyr Thr 
580 585 590 

Cys He Cys Asn Pro Gly Tyr Met Gly Ala He Cys Ser Asp Gin He 
595 600 605 

Asp Glu Cys Tyr Ser Ser Pro Cys Leu Asn Asp Gly Arg Cys II Asp 
610 615 620 

Leu Val Asn Gly Tyr Gin Cys Asn Cys Gin Pro Gly Thr Ser Gly Val 
625 630 635 640 

Asn Cys Glu He Asn Phe Asp Asp Cys Ala Ser Asn Pro Cys lie His 
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645 650 655 

Gly lie Cys Met Asp Gly lie Asn Arg Tyr Ser Cys Val Cys Ser Pro 
660 665 670 

Gly Phe Thr Gly Gin Arg Cys Asn lie Asp lie Asp Glu Cys Ala Ser 
675 680 685 

Asn Pro Cys Arg Lys Gly Ala Thr Cys lie Asn Gly Val Asn Gly Phe 
690 695 700 

Arg Cys lie Cys Pro Glu Gly Pro His His Pro Ser Cys Tyr Ser Gin 
705 710 715 720 

Val Asn Glu Cys Leu Ser Asn Pro Cys lie His Gly Asn Cys Thr Gly 
725 730 * 735 

Gly Leu Ser Gly Tyr Lys Cys Leu Cys Asp Ala Gly Trp Val Gly He 
740 745 750 

Asn Cys Glu Val Asp Lys Asn Glu Cys Leu Ser Asn Pro Cys Gin Asn 
755 760 765 

Gly Gly Thr Cys Asp Asn Leu Val Asn Gly Tyr Arg Cys Thr Cys Lys 
770 775 780 

Lys Gly Phe Lys Gly Tyr Asn Cys Gin Val Asn He Asp Glu Cys Ala 
785 790 795 800 

Ser Asn Pro Cys Leu Asn Gin Gly Thr Cys Phe Asp Asp He Ser Gly 
805 810 815 

Tyr Thr Cys His Cys Val Leu Pro Tyr Thr Gly Lys Asn Cys Gin Thr 
820 825 830 

Val Leu Ala Pro Cys Ser Pro Asn Pro Cys Glu Asn Ala Ala Val Cys 
835 840 845 

Lys Glu Ser Pro Asn Phe Glu Ser Tyr Thr Cys Leu Cys Ala Pro Gly 
850 855 860 

Trp Gin Gly Gin Arg Cys Thr He Asp He Asp Glu Cys He Ser Lys 
865 870 875 880 

Pro Cys Met Asn His Gly Leu Cys His Asn Thr Gin Gly Ser Tyr Met 
885 890 895 

Cys Glu Cys Pro Pro Gly Phe Ser Gly Met Asp Cys Glu Glu Asp He 
900 905 * 910 

Asp Asp Cys Leu Ala Asn Pro Cys Gin Asn Gly Gly Ser Cys Met Asp 
915 920 925 

Gly Val Asn Thr Phe Ser Cys Leu Cys Leu Pro Gly Phe Thr Gly Asp 
930 935 940 

Lys Cys Gin Thr Asp Met Asn Glu Cys Leu Ser Glu Pro Cys Lys Asn 
945 950 955 960 

Gly Gly Thr Cys Ser Asp Tyr Val Asn Ser Tyr Thr Cys Lys Cys Gin 
965 970 975 

Ala Gly Phe Asp Gly Val His Cys Glu Asn Asn He Asn Glu Cys Thr 
980 985 990 

Glu Ser Ser Cys Phe Asn Gly Gly Thr Cys Val Asp Gly He Asn Ser 
995 1000 1005 



ID: <WO 9407474A1_I_> 



WO 94/07474 



PCT/US93/09338 



-124- 

Phe Ser Cys Leu Cys Pro Val Gly Phe Thr Gly Ser Phe Cys Leu His 
1010 1015 1020 

Glu lie Asn Glu Cys Ser Ser His Pro Cys Leu Asn Glu Gly Thr Cys 
1025 1030 1035 1040 

Val Asp Gly Leu Gly Thr Tyr Arg Cys Ser Cys Pro Leu Gly Tyr Thr 
1045 1050 1055 

Gly Lys Asn Cys Gin Thr Leu Val Asn Leu Cys Ser Arg Ser Pro Cys 
1060 1065 1070 

Lys Asn Lys Gly Thr Cys Val Gin Lys Lys Ala Glu Ser Gin Cys Leu 
1075 1080 1085 

Cys Pro Ser Gly Trp Ala Gly Ala Tyr Cys Asp Val Pro Asn Val Ser 
1090 1095 1100 

Cys Asp He Ala Ala Ser Arg Arg Gly Val Leu Val Glu His Leu Cys 
1105 1110 1115 1120 

Gin His Ser Gly Val Cys He Asn Ala Gly Asn Thr His Tyr Cys Gin 
1125 1130 1135 

Cys Pro Leu Gly Tyr Thr Gly Ser Tyr Cys Glu Glu Gin Leu Asp Glu 
1140 1145 1150 

Cys Ala Ser Asn Pro Cys Gin His Gly Ala Thr Cys Ser Asp Phe lie 
1155 1160 1165 

Gly Gly Tyr Arg Cys Glu Cys Val Pro Gly Tyr Gin Gly Val Asn Cys 
1170 - 1175 1180 

Glu Tyr Glu Val Asp Glu Cys Gin Asn Gin Pro Cys Gin Asn Gly Gly 
1185 1190 1195 1200 

Thr Cys lie Asp Leu Val Asn His Phe Lys Cys Ser Cys Pro Pro Gly 
1205 1210 1215 

Thr Arg Gly Leu Leu Cys Glu Glu Asn He Asp Asp Cys Ala Arg Gly 
1220 1225 1230 

Pro His Cys Leu Asn Gly Gly Gin Cys Met Asp Arg lie Gly Gly Tyr 
1235 1240 1245 

Ser Cys Arg Cys Leu Pro Gly Phe Ala Gly Glu Arg Cys Glu Gly Asp 
1250 1255 1260 

lie Asn Glu Cys Leu Ser Asn Pro Cys Ser Ser Glu Gly Ser Leu Asp 
1265 1270 1275 1280 

Cys He Gin Leu Thr Asn Asp Tyr Leu Cys Val Cys Arg Ser Ala Phe 
1285 1290 1295 

Thr Gly Arg His Cys Glu Thr Phe Val Asp Val Cys Pro Gin Met Pro 
1300 1305 1310 

Cys Leu Asn Gly Gly Thr Cys Ala Val Ala Ser Asn Met Pro Asp Gly 
1315 1320 1325 

Phe He Cys Arg Cys Pro Pro Gly Phe Ser Gly Ala Arg Cys Gin Ser 
1330 1335 1340 

Ser Cys Gly Gin Val Lys Cys Arg Lys Gly Glu Gin Cys Val His Thr 
1345 1350 1355 1360 

Ala Ser Gly Pro Arg Cys Phe Cys Pro Ser Pro Arg Asp Cys Glu Ser 
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1365 1370 1375 

Gly Cys Ala Ser Ser Pro Cys Gin His Gly Gly Ser Cys His Pro Gin 
1380 1385 " 1390 

Arg Gin Pro Pro Tyr Tyr Ser Cys Gin Cys Ala Pro Pro Phe Ser Gly 
1395 1400 1405 

Ser Arg Cys Glu Leu Tyr Thr Ala Pro Pro Ser Thr Pro Pro Ala Thr 
1410 1415 1420 

Cys Leu Ser Gin Tyr Cys Ala Asp Lys Ala Arg Asp Gly Val Cys Asp 
1425 1430 1435 1440 

Glu Ala Cys Asn Ser His Ala Cys Gin Trp Asp Gly Gly Asp Cys Ser 
1445 1450 * 1455 

Leu Thr Met Glu Asn Pro Trp Ala Asn Cys Ser Ser Pro Leu Pro Cys 
1460 1465 1470 

Trp Asp Tyr lie Asn Asn Gin Cys Asp Glu Leu Cys Asn Thr Val Glu 
1475 1480 1485 

Cys Leu Phe Asp Asn Phe Glu Cys Gin Gly Asn Ser Lys Thr Cys Lys 
1490 1495 1500 

Tyr Asp Lys Tyr Cys Ala Asp His Phe Lys Asp Asn His Cys Asn Gin 
1505 1510 1515 1520 

Gly Cys Asn Ser Glu Glu Cys Gly Trp Asp Gly Leu Asp Cys Ala Ala 
1525 1530 ' 1535 

Asp Gin Pro Glu Asn Leu Ala Glu Gly Thr Leu Val lie Val Val Leu 
1540 1545 1550 

Met Pro Pro Glu Gin Leu Leu Gin Asp Ala Arg Ser Phe Leu Arg Ala 
1555 1560 1565 

Leu Gly Thr Leu Leu His Thr Asn Leu Arg lie Lys Arg Asp Ser Gin 
1570 1575 1580 

Gly Glu Leu Met Val Tyr Pro Tyr Tyr Gly Glu Lys Ser Ala Ala Met 
1585 1590 1595 1600 

Lys Lys Gin Arg Met Thr Arg Arg Ser Leu Pro Gly Glu Gin Glu Gin 
1605 1610 1615 

Glu Val Ala Gly Ser Lys Val Phe Leu Glu lie Asp Asn Arg Gin Cys 
1620 1625 1630 

Val Gin Asp Ser Asp His Cys Phe Lys Asn Thr Asp Ala Ala Ala Ala 
1635 1640 1645 

Leu Leu Ala Ser His Ala lie Gin Gly Thr Leu Ser Tyr Pro Leu Val 
1650 1655 1660 

Ser Val Val Ser Glu Ser Leu Thr Pro Glu Arg Thr Gin Leu Leu Tyr 
1665 1670 1675 1680 

Leu Leu Ala Val Ala Val Val lie lie Leu Phe lie He Leu Leu Gly 
1685 1690 1695 

Val He Met Ala Lys Arg Lys Arg Lys His Gly Ser Leu Trp Leu Pro 
1700 1705 1710 

Glu Gly Phe Thr Leu Arg Arg Asp Ala Ser Asn His Lys Arg Arg Glu 
1715 1720 1725 
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Pro Val Gly Gin Asp Ala Val Gly Leu Lys Asn LeuSer Val Gin Val 
1730 1735 1740 

Ser Glu Ala Asn Leu He Gly Thr Gly Thr Ser Glu His Trp Val Asp 
1745 1750 1755 1760 

Asp Glu Gly Pro Gin Pro Lys Lys Val Lys Ala Glu Asp Glu Ala Leu 
1765 1770 1775 

Leu Ser Glu Glu Asp Asp Pro He Asp Arg Arg Pro Trp Thr Gin Gin 
1780 1785 179U 

His Leu Glu Ala Ala Asp lie Arg Arg Thr Pro Ser Leu Ala Leu Thr 
1795 1800 1805 

Pro Pro Gin Ala Glu Gin Glu Val Asp Val Leu Asp Val Asn Val Arg 
1810 1815 1820 

Gly Pro Asp Gly Cys Thr Pro Leu Met Leu Ala Ser Leu Arg Gly Gly 
1825 1830 1835 1»4G 

Ser Ser Asp Leu Ser Asp Glu Asp Glu Asp Ala Glu Asp Ser Ser Ala 
1845 1850 1855 

Asn He He Thr Asp Leu Val Tyr Gin Gly Ala Ser Leu Gin Ala Gin 
I860 1865 XBVO 

Thr Asp Arg Thr Gly Glu Met Ala Leu His Leu Ala Ala Arg Tyr Ser 
1875 18 80 1885 

Arg Ala Asp Ala Ala Lys Arg Leu Leu Asp Ala Gly Ala Asp Ala Asn 
1890 1895 1900 

Ala Gin Asp Asn Met Gly Arg Cys Pro Leu His Ala Ala Val Ala Ala 
1905 1910 1915 1920 

Asp Ala Gin Gly Val Phe Gin He Leu He Arg Asn Arg Val Thr Asp 
1925 1930 1935 

Leu Asp Ala Arg Met Asn Asp Gly Thr Thr Pro Leu He Leu Ala Ala 
1940 1945 1950 

Arg Leu Ala Val Glu Gly Met Val Ala Glu Leu He Asn Cys Gin Ala 
1955 I960 1965 

Asp Val Asn Ala Val Asp Asp His Gly Lys Ser Ala Leu His Trp Ala 
1970 1975 1980 

Ala Ala Val Asn Asn Val Glu Ala Thr Leu Leu Leu Leu Lys Asn Gly 
1985 1990 1995 2000 

Ala Asn Arg Asp Met Gin Asp Asn Lys Glu Glu Thr Pro Leu Phe_Leu 
2005 2010 2015 

Ala Ala Arg Glu Gly Ser Tyr Glu Ala Ala Lys He Leu Leu Asp His 
2020 2025 2030 

Phe Ala Asn Arg Asp lie Thr Asp His Met Asp Arg Leu Pro Arg Asp 
2035 2040 2045 

Val Ala Arg Asp Arg Met His His Asp He Val Arg Leu Leu Asp Glu 
2050 2055 2060 

Tyr Asn Val Thr Pro Ser Pro Pro Gly Thr Val Leu Thr Ser Ala Leu 
2065 2070 2075 2080 

Ser Pro Val He Cys Gly Pro Asn Arg Ser Phe Leu Ser Leu Lys His 
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2085 2090 2095 

Thr Pro Met Gly Lys Lys Ser Arg Arg Pro Ser Ala Lys Ser Thr Met 
2100 2105 2110 

Pro Thr Ser Leu Pro Asn Leu Ala Lys Glu Ala Lys Asp Ala Lys Gly 
2115 2120 2125 

Ser Arg Arg Lys Lys Ser Leu Ser Glu Lys Val Gin Leu Ser Glu Ser 
2130 2135 2140 

Ser Val Thr Leu Ser Pro Val Asp Ser Leu Glu Ser Pro His Thr Tyr 
2145 2150 2155 2160 

Val Ser Asp Thr Thr Ser Ser Pro Met He Thr Ser Pro Gly He Leu 
2165 2170 2175 

Gin Ala Ser Pro Asn Pro Met Leu Ala Thr Ala Ala Pro Pro Ala Pro 
2180 • 2185 2190 

Val His Ala Gin His Ala Leu Ser Phe Ser Asn Leu His Glu Met Gin 
2195 2200 2205 

Pro Leu Ala His Gly Ala Ser Thr Val Leu Pro Ser Val Ser Gin Leu 
2210 2215 2220 

Leu Ser His His His He Val Ser Pro Gly Ser Gly Ser Ala Gly Ser 
2225 2230 2235 2240 

Leu Ser Arg Leu His Pro Val Pro Val Pro Ala Asp Trp Met Asn Arg 
2245 2250 2255 

Met Glu Val Asn Glu Thr Gin Tyr Asn Glu Met Phe Gly Met Val Leu 
2260 2265 2270 

Ala Pro Ala Glu Gly Thr His Pro Gly lie Ala Pro Gin Ser Arg Pro 
2275 2280 2285 

Pro Glu Gly Lys His He Thr Thr Pro Arg Glu Pro Leu Pro Pro He 
2290 2295 2300 

Val Thr Phe Gin Leu He Pro Lys Gly Ser He Ala Gin Pro Ala Gly 
2305 2310 2315 2320 

Ala Pro Gin Pro Gin Ser Thr Cys Pro Pro Ala Val Ala Gly Pro Leu 
2325 2330 2335 

Pro Thr Met Tyr Gin He Pro Glu Met Ala Arg Leu Pro Ser Val Ala 
2340 2345 2350 

Phe Pro Thr Ala Met Met Pro Gin Gin Asp Gly Gin Val Ala Gin Thr 
2355 2360 2365 

He Leu Pro Ala Tyr His Pro Phe Pro Ala Ser Val Gly Lys Tyr Pro 
2370 2375 2380 

Thr Pro Pro Ser Gin His Ser Tyr Ala Ser Ser Asn Ala Ala Glu Arg 
2385 2390 2395 2400 

Thr Pro Ser His Ser Gly His Leu Gin Gly Glu His Pro Tyr Leu Thr 
2405 2410 2415 

Pro Ser Pro Glu Ser Pro Asp Gin Trp Ser Ser Ser Ser Pro His Ser 
2420 2425 2430 

Ala Ser Asp Trp Ser Asp Val Thr Thr Ser Pro Thr Pro Gly Gly Ala 
2435 2440 2445 
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Gly Gly Gly Gin Arg Gly Pro Gly Thr His Met Ser Glu Pro Pro His 
2450 2455 2460 

Asn Asn Met Gin Val Tyr Ala 
2465 2470 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 2556 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Met Pro Pro Leu Leu Ala Pro Leu Leu Cys Leu Ala Leu Leu Pro Ala 
1 5 10 15 

Leu Ala Ala Arg Gly Pro Arg Cys Ser Gin Pro Gly Glu Thr Cys Leu 
20 25 30 

Asn Gly Gly Lys Cys Glu Ala Ala Asn Gly Thr Glu Ala Cys Val Cys 
35 40 45 

Gly Gly Ala Phe Val Gly Pro Arg Cys Gin Asp Pro Asn Pro Cys Leu 
50 55 60 

Ser Thr Pro Cys Lys Asn Ala Gly Thr Cys His Val Val Asp Arg Arg 
65 70 75 80 

Gly Val Ala Asp Tyr Ala Cys Ser Cys Ala Leu Gly Phe Ser Gly Pro 
85 90 « 

Leu Cys Leu Thr Pro Leu Asp Asn Ala Cys Leu Thr Asn Pro Cys Arg 
100 105 HO 

Asn Gly Gly Thr Cys Asp Leu Leu Thr Leu Thr Glu Tyr Lys Cys Arg 
115 120 125 

Cys Pro Pro Gly Trp Ser Gly Lys Ser Cys Gin Gin Ala Asp Pro Cys 
130 ~ 135 140 

Ala Ser Asn Pro Cys Ala Asn Gly Gly Gin Cys Leu Pro Phe Glu Ala 
145 150 155 160 

Ser Tyr lie Cys His Cys Pro Pro Ser Phe His Gly Pro Thr Cys Arg 
165 170 175 

Gin Asp val Asn Glu Cys Gly Gin Lys Pro Arg Leu Cys Arg His Gly 
180 1 Q5 190 

Gly Thr Cys His Asn Glu Val Gly Ser Tyr Arg Cys Val Cys Arg Ala 
195 200 205 

Thr His Thr Gly Pro Asn Cys Glu Arg Pro Tyr Val Pro Cys Ser Pro 
210 215 220 

Ser Pro Cys Gin Asn Gly Gly Thr Cys Arg Pro Thr Gly Asp Val Thr 
225 230 235 240 

His Glu Cys Ala Cys Leu Pro Gly Phe Thr Gly Gin Asn Cys Glu Glu 
245 250 255 
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Asn lie Asp Asp Cys Pre Gly Asn Asn Cys Lys Asn Gly Gly Ala Cys 
260 265 270 

Val Asp Gly Val Asn Thr Tyr Asn Cys Pro Cys Pro Pro Glu Trp Thr 
275 280 285 

Gly Gin Tyr Cys .Thr Glu Asp Val Asp Glu Cys Gin Leu Met Pro Asn 
290 295 300 

Ala Cys Gin Asn Gly Gly Thr Cys His Asn Thr His Gly Gly Tyr Asn 
305 310 315 320 

Cys Val Cys Val Asn Gly Trp Thr Gly Glu Asp Cys Ser Glu Asn lie 
325 330 335 

Asp Asp Cys Ala Ser Ala Ala Cys Phe His Gly Ala Thr Cys His Asp 
340 345 350 

Arg Val Ala Ser Phe Tyr Cys Glu Cys Pro His Gly Arg Thr Gly Leu 
355 360 365 

Leu Cys His Leu Asn Asp Ala Cys lie Ser Asn Pro Cys Asn Glu Gly 
370 375 380 

Ser Asn Cys Asp Thr Asn Pro Val Asn Gly Lys Ala lie Cys Thr Cys 
385 390 395 400 

Pro Ser Gly Tyr Thr Gly Pro Ala Cys Ser Gin Asp Val Asp Glu Cys 
405 410 415 

Ser Leu Gly Ala Asn Pro Cys Glu His Ala Gly Lys Cys lie Asn Thr 
420 425 430 

Leu Gly Ser Phe Glu Cys Gin Cys Leu Gin Gly Tyr Thr Gly Pro Arg 
435 440 445 

Cys Glu lie Asp Val Asn Glu Cys Val Ser Asn Pro Cys Gin Asn Asp 
450 455 460 

Ala Thr Cys Leu Asp Gin lie Gly Glu Phe Gin Cys Met Cys Met Pro 
465 470 475 480 

Gly Tyr Glu Gly Val His Cys Glu Val Asn Thr Asp Glu Cys Ala Ser 
485 490 495 

Ser Pro Cys Leu His Asn Gly Arg Cys Leu Asp Lys lie Asn Glu Phe 
500 505 510 

Gin Cys Glu Cys Pro Thr Gly Phe Thr Gly His Leu Cys Gin Tyr Asp 
515 520 525 

Val Asp Glu Cys Ala Ser Thr Pro Cys Lys Asn Gly Ala Lys Cys Leu 
530 535 540 

Asp Gly Pro Asn Thr Tyr Thr Cys Val Cys Thr Glu Gly Tyr Thr Gly 
545 550 555 560 

Thr His Cys Glu Val Asp lie Asp Glu Cys Asp Pro Asp Pro Cys His 
565 570 575 

Tyr Gly Ser Cys Lys Asp Gly Val Ala Thr Phe Thr Cys Leu Cys Arg 
580 585 590 

Pro Gly Tyr Thr Gly His His Cys Glu Thr Asn lie Asn Glu Cys Ser 
595 600 605 

Ser Gin Pro Cys Arg Leu Arg Gly Thr Cys Gin Asp Pro Asp Asn Ala 



ID: <WO 9407474A1J_: 



WO 94/07474 



PCI7US93/09338 



-130- 

610 615 620 

Tvr Leu Cvs Phe Cys Leu Lys Gly Thr Thr Gly Pro Asn Cys Glu lie 
625 630 635 640 

Asn Leu Asp Asp Cys Ala Ser Ser Pro Cys Asp Ser Gly Thr Cys Leu 
645 650 655 

Asp Lys lie Asp Gly Tyr Glu cys Ala Cys Glu Pro Gly Tyr Thr Gly 
660 665 670 

Ser Met Cys Asn Ser Asn He Asp Glu Cys Ala Gly Asn Pro Cys His 
675 680 685 

Asn Gly Gly Thr Cys Glu Asp Gly He Asn Gly Phe Thr Cys Arg Cys 
690 695 700 

Pro Glu Gly Tyr His Asp Pro Thr Cys Leu Ser Glu Val Asn Glu Cys 
705 710 715 720 

Asn Ser Asn Pro Cys Val His Gly Ala Cys Arg Asp Ser Leu Asn Gly 
725 730 735 

Tyr Lys Cys Asp Cys Asp Pro Gly Trp Ser Gly Thr Asn Cys Asp He 
740 745 750 

Asn Asn Asn Glu Cys Glu Ser Asn Pro Cys Val Asn Gly Gly Thr Cys 
755 760 765 

Lys Asp Met Thr Ser Gly He Val Cys Thr Cys Arg Glu Gly Phe Ser 
770 775 780 

Gly Pro Asn Cys Gin Thr Asn He Asn Glu Cys Ala Ser Asn Pro Cys 
785 790 795 800 

Leu Asn Lys Gly Thr Cys He Asp Asp Val Ala Gly Tyr Lys Cys Asn 
805 810 815 

Cys Leu Leu Pro Tyr Thr Gly Ala Thr Cys Glu Val Val Leu Ala Pro 
820 825 830 

Cys Ala Pro Ser Pro Cys Arg Asn Gly Gly Glu Cys Arg Gin Ser Glu 
835 840 845 

Asp Tyr Glu Ser Phe Ser Cys Val Cys Pro Thr Ala Gly Ala Lys Gly 
850 855 860 

Gin Thr Cys Glu Val Asp He Asn Glu Cys Val Leu Ser Pro Cys Arg 
865 870 87 5 880 

His Gly Ala Ser Cys Gin Asn Thr His Gly Gly Tyr Arg Cys His Cys 
885 890 895 

Gin Ala Gly Tyr Ser Gly Arg Asn Cys Glu Thr Asp He Asp Asp Cys 
900 905 910 

Arg Pro Asn Pro Cys His Asn Gly Gly Ser Cys Thr Asp Gly He Asn 
915 920 925 

Thr Ala Phe Cys Asp Cys Leu Pro Gly Phe Arg Gly Thr Phe Cys Glu 
930 " 935 940 

Glu Asp He Asn Glu Cys Ala Ser Asp Pro Cys Arg Asn Gly Ala Asn 
945 * 950 955 960 

Cvs Thr Asp Cys Val Asp Ser Tyr Thr Cys Thr Cys Pro Ala Gly Phe 
1 965 970 975 
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Ser Gly lie His Cys Glu Asn Asn Thr Pro Asp Cys Thr Glu Ser Ser 
980 985 990 

Cys Phe Asn Gly Gly Thr Cys Val Asp Gly lie Asn ser Phe Thr Cys 
995 * 1000 1005 

Leu Cys Pro Pro Gly Phe Thr Gly Ser Tyr Cys Gin His Val Val Asn 
1010 1015 1020 

Glu Cys Asp Ser Arg Pro Cys Leu Leu Gly Gly Thr Cys Gin Asp Gly 
1025 1030 1035 1040 

Arg Gly Leu His Arg Cys Thr Cys Pro Gin Gly Tyr Thr Gly Pro Asn 
1045 1050 1055 

Cys Gin Asn Leu Val His Trp Cys Asp Ser Ser Pro Cys Lys Asn Gly 
1060 1065 1070 

Gly Lys Cys Trp Gin Thr His Thr Gin Tyr Arg Cys Glu Cys Pro Ser 
1075 1080 1085 

Gly Trp Thr Gly Leu Tyr Cys Asp Val Pro Ser Val Ser Cys Glu Val 
1090 " 1095 1100 

Ala Ala Gin Arg Gin Gly Val Asp Val Ala Arg Leu Cys Gin His Gly 
1105 1110 1115 1120 

Gly Leu Cys Val Asp Ala Gly Asn Thr His His Cys Arg Cys Gin Ala 
1125 ~ 1130 1135 

Gly Tyr Thr Gly Ser Tyr Cys Glu Asp Leu Val Asp Glu Cys Ser Pro 
1140 1145 1150 

Ser Pro Cys Gin Asn Gly Ala Thr Cys Thr Asp Tyr Leu Gly Gly Tyr 
1155 1160 1165 

Ser Cys Lys Cys Val Ala Gly Tyr His Gly Val Asn Cys Ser Glu Glu 
1170 v 1175 1180 

lie Asp Glu Cys Leu Ser His Pro Cys Gin Asn Gly Gly Thr Cys Leu 
1185 1190 1195 1200 

Asp Leu Pro Asn Thr Tyr Lys Cys Ser Cys Pro Arg Gly Thr Gin Gly 
1205 1210 1215 

Val His Cys Glu lie Asn Val Asp Asp Cys Asn Pro Pro Val Asp Pro 
1220 1225 1230 

Val Ser Arg Ser Pro Lys Cys Phe Asn Asn Gly Thr Cys Val Asp Gin 
1235 1240 ^ 1245 

Val Gly Gly Tyr Ser Cys Thr Cys Pro Pro Gly Phe Val Gly Glu Arg 
1250 1255 1260 

Cys Glu Gly Asp Val Asn Glu Cys Leu Ser Asn Pro Cys Asp Ala Arg 
1265 ' 1270 1275 1280 

Gly Thr Gin Asn Cys Val Gin Arg Val Asn Asp Phe His Cys Glu Cys 
1285 1290 1295 

Arg Ala Gly His Thr Gly Arg Arg Cys Glu Ser Val lie Asn Gly Cys 
1300 1305 1310 

Lys Gly Lys Pro Cys Lys Asn Gly Gly Thr Cys Ala Val Ala Ser Asn 
1315 1320 1325 

Thr Ala Arg Gly Phe lie Cys Lys Cys Pro Ala Gly Phe Glu Gly Ala 
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1330 1335 "40 

Thr Cys Glu Asn Asp Ala Arg Thr Cys Gly Ser Leu Arg Cy S Leu A.n 
1345 1350 1JS * 

Gly Gly Thr Cys lie Ser Gly Pro Arg Ser Pro Thr Cys Leu Cys Leu 
1365 1370 

Gly Pro Phe Thr Gly Pro Glu Cys Gin Phe Pro Ala Ser Ser Pro Cys 

* 1380 1385 1390 

Leu Gly Gly Asn Pro Cys Tyr Asn Gin Gly Thr Cys Glu Pro Thr Ser 
1395 1400 

Glu Ser Pro Phe Tyr Arg Cys Leu Cys Pro Ala Lys Phe Asn Gly Leu 
1410 1415 1420 

Leu Cys His He Leu Asp Tyr Ser Phe Gly Gly Gly Ala Gly Arg Asp 
1425 1430 1435 

He Pro Pro Pro Leu He Glu Glu Ala Cys Glu Leu Pro Glu Cys Gin 
1445 1450 

Glu Asp Ala Gly Asn Lys Val Cys Ser Leu Gin Cys Asn Asn His Ala 
1460 1465 ,i*/u 

Cys Gly Trp Asp Gly Gly Asp Cys Ser Leu Asn Phe Asn Asp Pro Trp 
1475 1480 1485 

Lys Asn Cys Thr Gin Ser Leu Gin Cys Trp Lys Tyr Phe Ser Asp Gly 
1490 1495 1500 

His Cys Asp Ser Gin Cys Asn Ser Ala Gly Cys Leu Phe Asp Gly Phe 
1505 1510 1515 15ZO 

Asp Cys Gin Arg Ala Glu Gly Gin Cys Asn Pro Leu Tyr Asp Gin Tyr 

* 1525 1530 1535 

Cys Lys Asp His Phe Ser Asp Gly His Cys Asp Gin Gly Cys Asn Ser 
1540 1545 1550 

Ala Glu Cys Glu Trp Asp Gly Leu Asp Cys Ala Glu His Val Pro Glu 
1555 1560 1565 

Arg Leu Ala Ala Gly Thr Leu Val Val Val Val Leu Met Pro Pro Glu 
1570 1575 1580 

Gin Leu Arg Asn ser Ser Phe His Phe Leu Arg Glu Leu Ser Arg Val 
1585 1590 1595 1600 

Leu His Thr Asn Val Val Phe Lys Arg Asp Ala His Gly Gin Gin Met 
1605 1510 l 6i5 

He Phe Pro Tyr Tyr Gly Arg Glu Glu Glu Leu Arg Lys His Pro He 
1620 1625 1630 

Lys Arg Ala Ala Glu Gly Trp Ala Ala Pro Asp Ala Leu Leu Gly Gin 
1635 1540 1545 

Val Lys Ala Ser Leu Leu Pro Gly Gly Ser Glu Gly Gly Arg Arg Arg 
1650 1655 1660 

Arg Glu Leu Asp Pro Met Asp Val Arg Gly Ser He Val Tyr Leu Glu 
1665 1670 1675 16SO 

He Asp Asn Arg Gin Cys Val Gin Ala Ser Ser Gin Cys Phe Gin Ser 
1685 1690 1695 
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Ala Thr Asp Val Ala Ala Phe Leu Gly Ala Leu Ala Ser Leu Gly Ser 
1700 1705 1710 

Leu Asn lie Pro Tyr Lys lie Glu Ala Val Gin Ser Glu Thr Val Glu 
1715 1720 1725 

Pro Pro Pro Pro Ala Gin Leu His Phe Met Tyr Val Ala Ala Ala Ala 
1730 1735 1740 

Phe Val Leu Leu Phe Phe Val Gly Cys Gly Val Leu Leu Ser Arg Lys 
1745 1750 1755 1760 

Arg Arg Arg Gin His Gly Gin Leu Trp Phe Pro Glu Gly Phe Lys Val 
1765 1770 1775 

Ser Glu Ala Ser Lys Lys Lys Arg Arg Glu Glu Leu Gly Glu Asp Ser 
1780 1785 1790 

Val Gly Leu Lys Pro Leu Lys Asn Ala Ser Asp Gly Ala Leu Met Asp 
1795 1800 1805 

Asp Asn Gin Asn Glu Trp Gly Asp Glu Asp Leu Glu Thr Lys Lys Phe 
1810,. 1815 1820 

Arg Phe Glu Glu Pro Val Val Leu Pro Asp Leu Asp Asp Gin Thr Asp 
1825 1830 1835 1840 

His Arg Gin Trp Thr Gin Gin His Leu Asp Ala Ala Asp Leu Arg Met 
1845 1850 1855 

Ser Ala Met Ala Pro Thr Pro Pro Gin Gly Glu Val Asp Ala Asp Cys 
1860 1865 1870 

Met Asp Val Asn Val Arg Gly Pro Asp Gly Phe Thr Pro Leu Met lie 
1875 1880 1885 

Ala Ser Cys Ser Gly Gly Gly Leu Glu Thr Gly Asn Ser Glu Glu Glu 
1890 1895 1900 

Glu Asp Ala Pro Ala Val lie Ser Asp Phe lie Tyr Gin Gly Ala Ser 
1905 1910 ~ 1915 1920 

Leu His Asn Gin Thr Asp Arg Thr Gly Glu Thr Ala Leu His Leu Ala 
1925 1930 1935 

Ala Arg Tyr Ser Arg Ser Asp Ala Ala Lys Arg Leu Leu Glu Ala Ser 
1940 1945 1950 

Ala Asp Ala Asn He Gin Asp Asn Met Gly Arg Thr Pro Leu His Ala 
1955 1960 1965 

Ala Val Ser Ala Asp Ala Gin Gly Val Phe Gin lie Leu He Arg Asn 
1970 1975 1980 

Arg Ala Thr Asp Leu Asp Ala Arg Met His Asp Gly Thr Thr Pro Leu 
1985 1990 1995 2000 

He Leu Ala Ala Arg Leu Ala Val Glu Gly Met Leu Glu Asp Leu He 
20O5 2010 2015 

Asn Ser His Ala Asp Val Asn Ala Val Asp Asp Leu Gly Lys Ser Ala 
2020 2025 2030 

Leu His Trp Ala Ala Ala Val Asn Asn Val Asp Ala Ala Val Val Leu 
2035 2040 2045 

Leu Lys Asn Gly Ala Asn Lys Asp Met Gin Asn Asn Arg Glu Glu Thr 
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2050 2055 2060 

Pro Leu Phe Leu Ala Ala Arg Glu Gly Ser Tyr Glu Thr Ala Lys Val 
2065 2070 2075 2080 

Leu Leu Asp His Phe Ala Asn Arg Asp He Thr Asp His Met Asp Arg 
2085 2090 2095 

Leu Pro Arg Asp lie Ala Gin Glu Arg Met His His Asp He Val Arg 
2100 2105 2110 

Leu Leu Asp Glu Tyr Asn Leu Val Arg Ser Pro Gin Leu His Gly Ala 
2115 2120 2125 

Pro Leu Gly Gly Thr Pro Thr Leu Ser Pro Pro Leu Cys Ser Pro Asn 
2130 2135 2140 

Gly Tyr Leu Gly Ser Leu Lys Pro Gly Val Gin Gly Lys Lys Val Arg 
2145 2150 2155 2160 

Lys Pro Ser Ser Lys Gly Leu Ala Cys Gly Ser Lys Glu Ala Lys Asp 
2165 2170 2175 

Leu Lys Ala Arg Arg Lys Lys Ser Gin Asp Gly Lys Gly Cys Leu Leu 
2180 2185 2190 

Asp Ser Ser Gly Met Leu Ser Pro Val Asp Ser Leu Glu Ser Pro His 
2195 2200 2205 

Gly Tyr Leu Ser Asp Val Ala Ser Pro Pro Leu Leu Pro Ser Pro Phe 
2210 2215 2220 

Gin Gin Ser Pro Ser Val Pro Leu Asn His Leu Pro Gly Met Pro Asp 
2225 2230 2235 2240 

Thr His Leu Gly lie Gly His Leu Asn Val Ala Ala Lys Pro Glu Met 
2245 225Q 2255 

Ala Ala Leu Gly Gly Gly Gly Arg Leu Ala Phe Glu Thr Gly Pro Pro 
2260 2265 2270 

Arg Leu Ser His Leu Pro Val Ala Ser Gly Thr Ser Thr Val Leu Gly 
2275 2280 2285 

Ser Ser Ser Gly Gly Ala Leu Asn Phe Thr Val Gly Gly Ser Thr Ser 
2290 2295 2300 

Leu Asn Gly Gin Cys Glu Trp Leu Ser Arg Leu Gin Ser Gly Met Val 
2305 ~ 2310 2315 2320 

Pro Asn Gin Tyr Asn Pro Leu Arg Gly Ser Val Ala Pro Gly Pro Leu 
2325 2330 2335 

Ser Thr Gin Ala Pro Ser Leu Gin His Gly Met Val Gly Pro Leu His 
2340 2345 2350 

Ser Ser Leu Ala Ala Ser Ala Leu Ser Gin Met Met Ser Tyr Gin Gly 
2355 2360 2365 

Leu Pro Ser Thr Arg Leu Ala Thr Gin Pro His Leu Val Gin Thr Gin 
2370 ~ 2375 2380 

Gin Val Gin Pro Gin Asn Leu Gin Met Gin Gin Gin Asn Leu Gin Pro 
2385 2390 2395 2400 

Ala Asn He Gin Gin Gin Gin Ser Leu Gin Pro Pro Pro Pro Pro Pro 
2405 2410 2415 
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Gln Pro His Leu Gly Val Ser Ser Ala Ala Ser Gly His Leu Gly Arg 
2420 2425 2430 

Ser Phe Leu Ser Gly Glu Pro Ser Gin Ala Asp Val Gin Pro Leu Gly 
2435 2440 2445 

Pro Ser Ser Leu Ala Val His Thr lie Leu Pro Gin Glu Ser Pro Ala 
2450 2455 2460 

Leu Pro Thr Ser Leu Pro Ser Ser Leu Val Pro Pro Val Thr Ala Ala 
2465 2470 2475 2480 

Gin Phe Leu Thr Pro Pro Ser Gin His Ser Tyr Ser Ser Pro Val Glu 
2485 2490 2495 

Asn Thr Pro Ser His Gin Leu Gin Val Pro Glu His Pro Phe Leu Thr 
2500 2505 2510 

Pro Ser Pro Glu Ser Pro Asp Gin Trp Ser Ser Ser Ser Pro His Ser 
2515 2520 2525 

Asn Val Ser Asp Trp Ser Glu Gly Val Ser Ser Pro Pro Thr Ser Met 
2530 2535 2540 

Gin Ser Gin He Ala Arg lie Pro Glu Ala Phe Lys 
2545 2550 2555 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9723 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 10.. 7419 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

GGAATTCCG CCC GCC CTG CGC CCC GCT CTG CTG TGG GCG CTG CTG GCG 48 
Pro Ala Leu Arg Pro Ala Leu Leu Trp Ala Leu Leu Ala 
1 5 10 

CTC TGG CTG TGC TGC GCG GCC CCC GCG CAT GCA TTG CAG TGT CGA GAT 96 
Leu Trp Leu Cys Cys Ala Ala Pro Ala His Ala Leu Gin Cys Arg Asp 
15 20 25 

GGC TAT GAA CCC TGT GTA AAT GAA GGA ATG TGT GTT ACC TAC CAC AAT 144 
Gly Tyr Glu Pro Cys Val Asn Glu Gly Met Cys Val Thr Tyr His Asn 
36 * 35 40 45 

GGC ACA GGA TAC TGC AAA TGT CCA GAA GGC TTC TTG GGG GAA TAT TGT 192 
Gly Thr Gly Tyr Cys Lys Cys Pro Glu Gly Phe Leu Gly Glu Tyr Cys . 

50 55 60 

CAA CAT CGA GAC CCC TGT GAG AAG AAC CGC TGC CAG AAT GGT GGG ACT 240 
Gin His Arg Asp Pro Cys Glu Lys Asn Arg Cys Gin Asn Gly Gly Thr 
65 70 75 

TGT GTG GCC CAG GCC ATG CTG GGG AAA GCC ACG TGC CGA TGT GCC TCA 288 
Cys Val Ala Gin Ala Met Leu Gly Lys Ala Thr Cys Arg Cys Ala Ser 
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80 85 90 

GGG TTT ACA GGA GAG GAC TGC CAG TAC TCA ACA TCT CAT CCA TGC TTT 336 
Gly Phe Thr Gly Glu Asp Cys Gin Tyr Ser Thr Ser His Pro Cys Phe 
95 100 105 

GTG TCT CGA CCC TGC CTG AAT GGC GGC ACA TGC CAT ATG CTC AGC CGG 384 
Val Ser Arg Pro Cys Leu Asn Gly Gly Thr Cys His Met Leu Ser Arg 
.110 115 120 125 

GAT ACC TAT , GAG TGC ACC TGT CAA GTC GGG TTT ACA GGT AAG GAG TGC 432 
Asp Thr Tyr Glu Cys Thr Cys Gin Val Gly Phe Thr Gly Lys Glu Cys 
130 135 140 

CAA TGG ACG GAT GCC TGC CTG TCT CAT CCC TGT GCA AAT GGA AGT ACC 480 
Gin Trp Thr Asp Ala Cys Leu Ser His Pro Cys Ala Asn Gly Ser Thr 
145 150 1 155 

TGT ACC ACT GTG GCC AAC CAG TTC TCC TGC AAA TGC CTC ACA GGC TTC 528 
Cys Thr Thr Val Ala Asn Gin Phe Ser Cys Lys Cys Leu Thr Gly Phe 
160 165 170 

ACA GGG CAG AAA TGT GAG ACT GAT GTC AAT GAG TGT GAC ATT CCA GGA 576 
Thr Gly Gin Lys Cys Glu Thr Asp Val Asn Glu Cys Asp lie Pro Gly 
175 180 185 

CAC TGC CAG CAT GGT GGC ACC TGC CTC AAC CTG CCT GGT TCC TAC CAG 624 
His Cys Gin His Gly Gly Thr Cys Leu Asn Leu Pro Gly Ser Tyr Gin 
190 195 200 205 

TGC CAG TGC CCT CAG GGC TTC ACA GGC CAG TAC TGT GAC AGC CTG TAT 672 
Cys Gin Cys Pro Gin Gly Phe Thr Gly Gin Tyr Cys Asp Ser Leu Tyr 
210 215 220 

GTG CCC TGT GCA CCC TCA CCT TGT GTC AAT GGA GGC ACC TGT CGG CAG 720 
Val Pro Cys Ala Pro Ser Pro Cys Val Asn Gly Gly Thr Cys Arg Gin 
225 230 235 

ACT GGT GAC TTC ACT TTT GAG TGC AAC TGC CTT CCA GGT TTT GAA GGG 768 
Thr Gly Asp Phe Thr Phe Glu Cys Asn Cys Leu Pro Gly Phe Glu Gly 
240 245 250 

AGC ACC TGT GAG AGG AAT ATT GAT GAC TGC CCT AAC CAC AGG TGT CAG 816 
Ser Thr Cys Glu Arg Asn lie Asp Asp Cys Pro Asn His Arg Cys Gin 
255 260 265 

AAT GGA GGG GTT TGT GTG GAT GGG GTC AAC ACT TAC AAC TGC CGC TGT 864 
Asn Gly Gly Val Cys Val Asp Gly Val Asn Thr Tyr Asn Cys Arg Cys 
270 275 280 285 

CCC CCA CAA TGG ACA GGA CAG TTC TGC ACA GAG GAT GTG GAT GAA TGC 912 
Pro Pro Gin Trp Thr Gly Gin Phe Cys Thr Glu Asp Val Asp Glu Cys 
290 295 300 

CTG CTG CAG CCC AAT GCC TGT CAA AAT GGG GGC ACC TGT GCC AAC CGC 960 
Leu Leu Gin Pro Asn Ala Cys Gin Asn Gly Gly Thr Cys Ala Asn Arg 
305 310 315 

AAT GGA GGC TAT GGC TGT GTA TGT GTC AAC GGC TGG AGT GGA GAT GAC 1008 
Asn Gly Gly Tyr Gly Cys Val Cys Val Asn Gly Trp Ser Gly Asp Asp 
320 325 330 

TGC AGT GAG AAC ATT GAT GAT TGT GCC TTC GCC TCC TGT ACT CCA GGC 1056 
Cys Ser Glu Asn He Asp Asp Cys Ala Phe Ala Ser Cys Thr Pro Gly 
335 340 345 

TCC ACC TGC ATC GAC CGT GTG GCC TCC TTC TCT TGC ATG TGC CCA GAG 1104 
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Ser Thr Cys lie Asp Arg Val Ala Ser Phe Ser Cys Met Cys Pro Glu 
350 355 360 365 

GGG AAG GCA GGT CTC CTG TGT CAT CTG GAT GAT GCA TGC ATC AGC AAT 1152 
Gly Lys Ala Gly Leu Leu Cys His Leu Asp Asp Ala Cys lie Ser Asn 
370 375 380 

CCT TGC CAC AAG GGG GCA CTG TGT GAC ACC AAC CCC CTA AAT GGG CAA 1200 
Pro Cys His Lys Gly Ala Leu Cys Asp Thr Asn Pro Leu Asn Gly Gin 
385 390 395 

TAT ATT TGC ACC TGC CCA CAA GGC TAC AAA GGG GCT GAC TGC ACA GAA 1248 
Tyr lie Cys Thr Cys Pro Gin Gly Tyr Lys Gly Ala Asp Cys Thr Glu 
400 405 410 

GAT GTG GAT GAA TGT GCC ATG GCC AAT AGC AAT CCT TGT GAG CAT GCA 1296 
Asp Val Asp Glu Cys Ala Met Ala Asn Ser Asn Pro Cys Glu His Ala 
415 420 425 



GGA AAA TGT GTG AAC ACG GAT GGC GCC TTC CAC TGT GAG TGT CTG AAG 
Gly Lys Cys Val Asn Thr Asp Gly Ala Phe His Cys Glu Cys Leu Lys 
430 435 440 445 



1344 



GGT TAT GCA GGA CCT CGT TGT GAG ATG GAC ATC AAT GAG TGC CAT TCA 
Gly Tyr Ala Gly Pro Arg Cys Glu Met Asp lie Asn Glu Cys His Ser 
450 455 460 



1392 



GAC CCC TGC CAG AAT GAT GCT ACC TGT CTG GAT AAG ATT GGA GGC TTC 
Asp Pro Cys Gin Asn Asp Ala Thr Cys Leu Asp Lys lie Gly Gly Phe 
465 470 475 



1440 



ACA TGT CTG TGC ATG CCA GGT TTC AAA GGT GTG CAT TGT GAA TTA GAA 
Thr Cys Leu Cys Met Pro Gly Phe Lys Gly Val His Cys Glu Leu Glu 
480 485 490 



1488 



ATA AAT GAA TGT CAG AGC AAC CCT TGT GTG AAC AAT GGG CAG TGT GTG 
lie Asn Glu Cys Gin ser Asn Pro Cys Val Asn Asn Gly Gin Cys Val 
495 . 500 505 



1536 



GAT AAA GTC AAT CGT TTC CAG TGC CTG TGT CCT CCT GGT TTC ACT GGG 
Asp Lys Val Asn Arg Phe Gin Cys Leu Cys Pro Pro Gly Phe Thr Gly 
510 515 520 525 



1584 



CCA GTT TGC CAG ATT GAT ATT GAT GAC TGT TCC AGT ACT CCG TGT CTG 
Pro Val Cys Gin lie Asp lie Asp Asp Cys Ser Ser Thr Pro Cys Leu 
530 535 540 



1632 



AAT GGG GCA AAG TGT ATC GAT CAC CCG AAT GGC TAT GAA TGC CAG TGT 
Asn Gly Ala Lys Cys He Asp His Pro Asn Gly Tyr Glu Cys Gin Cys 
545 550 555 



1680 



GCC ACA GGT TTC ACT GGT GTG TTG TGT GAG GAG AAC ATT GAC AAC TGT 
Ala Thr Gly Phe Thr Gly Val Leu Cys Glu Glu Asn He Asp Asn Cys 
560 565 570 



1728 



GAC CCC GAT CCT TGC CAC CAT GGT CAG TGT CAG GAT GGT ATT GAT TCC 1776 
Asp Pro Asp Pro Cys His His Gly Gin Cys Gin Asp Gly He Asp Ser 
575 580 585 

TAC ACC TGC ATC TGC AAT CCC GGG TAC ATG GGC GCC ATC TGC AGT GAC 1824 
Tyr Thr Cys lie Cys Asn Pro Gly Tyr Met Gly Ala lie Cys Ser Asp 
590 595 600 605 



CAG ATT GAT GAA TGT TAC AGC AGC CCT TGC CTG AAC GAT GGT CGC TGC 
Gin He Asp Glu Cys Tyr Ser Ser Pro Cys Leu Asn Asp Gly Arg Cys 
610 615 620 



1872 
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ATT GAC CTG GTC AAT GGC TAC CAG TGC AAC TGC CAG CCA GGC ACG TCA 
lie Asp Leu Val Asn Gly Tyr Gin Cys Asn Cys Gin Pro Gly Thr Ser 
625 630 635 



1920 



GGG GTT AAT TGT GAA ATT AAT TTT GAT GAC TGT GCA AGT AAC CCT TGT 
Gly Val Asn Cys Glu lie Asn Phe Asp Asp Cys Ala Ser Asn Pro Cys 
640 645 ~ 650 



1968 



ATC CAT GGA ATC TGT ATG GAT GGC ATT AAT CGC TAC AGT TGT GTC TGC 
lie His Gly lie Cys Met Asp Gly lie Asn Arg Tyr Ser Cys Val Cys 
655 660 665 



2016 



TCA CCA GGA TTC ACA GGG CAG AGA TGT AAC ATT GAC ATT GAT GAG TGT 
Ser Pro Gly Phe Thr Gly Gin Arg Cys Asn lie Asp lie Asp Glu Cys 
670 675 680 685 



2064 



GCC TCC AAT^CCC TGT CGC AAG GGT GCA ACA TGT ATC AAC GGT GTG AAT 
Ala Ser Asn Pro Cys Arg Lys Gly Ala Thr Cys lie Asn Gly Val Asn 
690 695 700 



2112 



GGT TTC CGC TGT ATA TGC CCC GAG GGA CCC CAT CAC CCC AGC TGC TAC 
Gly Phe Arg Cys lie Cys Pro Glu Gly Pro His His Pro Ser Cys Tyr 
705 710 715 



2160 



TCA CAG GTG AAC GAA TGC CTG AGC AAT CCC TGC ATC CAT GGA AAC TGT 
Ser Gin Val Asn Glu Cys Leu Ser Asn Pro Cys lie His Gly Asn Cys 
720 725 730 



2208 



ACT GGA GGT CTC AGT GGA TAT AAG TGT CTC TGT GAT GCA GGC TGG GTT 
Thr Gly Gly Leu Ser Gly Tyr Lys Cys Leu Cys Asp Ala Gly Trp Val 
735 740 745 



2256 



GGC ATC AAC TGT GAA GTG GAC AAA AAT GAA TGC CTT TCG AAT CCA TGC 
Gly lie Asn Cys Glu Val Asp Lys Asn Glu Cys Leu Ser Asn Pro Cys 
750 755 760 765 



2304 



CAG AAT GGA GGA ACT TGT GAC AAT CTG GTG AAT GGA TAC AGG TGT ACT 
Gin Asn Gly Gly Thr Cys, Asp Asn Leu Val Asn Gly Tyr Arg Cys Thr 
770 775 780 



2352 



TGC AAG AAG GGC TTT AAA GGC TAT AAC TGC CAG GTG AAT ATT GAT GAA 
Cys Lys Lys Gly Phe Lys Gly Tyr Asn Cys Gin Val Asn lie Asp Glu 
785 790 795 



2400 



TGT GCC TCA AAT CCA TGC CTG AAC CAA GGA ACC TGC TTT GAT GAC ATA 
Cys Ala Ser Asn Pro Cys Leu Asn Gin Gly Thr Cys Phe Asp Asp lie 
800 805 810 



2448 



AGT GGC TAC ACT TGC CAC TGT GTG CTG CCA TAC ACA GGC AAG AAT TGT 
Ser Gly Tyr Thr Cys His Cys Val Leu Pro Tyr Thr Gly Lys Asn Cys 
815 820 825 



2496 



CAG ACA GTA TTG GCT CCC TGT TCC CCA AAC CCT TGT GAG AAT GCT GCT 
Gin Thr Val Leu Ala Pro Cys Ser Pro Asn Pro Cys Glu Asn Ala Ala 
830 835 840 845 



2544 



GTT TGC AAA GAG TCA CCA AAT TTT GAG AGT TAT ACT TGC TTG TGT GCT 
Val Cys Lys Glu Ser Pro Asn Phe Glu Ser Tyr Thr Cys Leu Cys Ala 
850 855 860 



2592 



CCT GGC TGG CAA GGT CAG CGG TGT ACC ATT GAC ATT GAC GAG TGT ATC 
Pro Gly Trp Gin Gly Gin Arg Cys Thr lie Asp He Asp Glu Cys He 
865 870 875 



2640 



TCC AAG CCC TGC ATG AAC CAT GGT CTC TGC CAT AAC ACC CAG GGC AGC 
Ser Lys Pro Cys Met Asn His Gly Leu Cys His Asn Thr Gin Gly Ser 
880 885 890 



2688 
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TAC ATG TGT GAA TGT CCA CCA GGC TTC AGT GGT ATG GAC TGT GAG GAG 2736 

Tyr Met Cys Glu Cys Pro Pro Gly Phe Ser Gly Met Asp Cys Glu Glu 

895 900 905 

GAC ATT GAT GAC TGC CTT GCC AAT CCT TGC CAG AAT GGA GGT TCC TGT 2784 
Asp lie Asp Asp Cys Leu Ala Asn Pro Cys Gin Asn Gly Gly Ser Cys 
910 915 920 925 

ATG GAT GGA GTG AAT ACT TTC TCC TGC CTC TGC CTT CCG GGT TTC ACT 2832 
Met Asp Gly Val Asn Thr Phe Ser Cys Leu Cys Leu Pro Gly Phe Thr 
930 935 940 

GGG GAT AAG TGC CAG ACA GAC ATG AAT GAG TGT CTG AGT GAA CCC TGT 2880 
Gly Asp Lys Cys Gin Thr Asp Met Asn Glu Cys Leu Ser Glu Pro Cys 
945 ~ 950 955 

AAG AAT GGA GGG ACC TGC TCT GAC TAC GTC AAC AGT TAC ACT TGC AAG 2928 
Lys Asn Gly Gly Thr Cys Ser Asp Tyr Val Asn Ser Tyr Thr Cys Lys 
960 965 970 

TGC CAG GCA GGA TTT GAT GGA GTC CAT TGT GAG AAC AAC ATC AAT GAG 2976 
Cys Gin Ala Gly Phe Asp Gly Val His Cys Glu Asn Asn He Asn Glu 
975 980 985 

TGC ACT GAG AGC TCC TGT TTC AAT GGT GGC ACA TGT GTT GAT GGG ATT 3024 
Cys Thr Glu Ser ser Cys Phe Asn Gly Gly Thr Cys Val Asp Gly lie 
990 995 1000 1005 

AAC TCC TTC TCT TGC TTG TGC CCT GTG GGT TTC ACT GGA TCC TTC TGC 3072 
Asn Ser Phe Ser Cys Leu Cys Pro Val Gly Phe Thr Gly Ser Phe Cys 
1010 1015 1020 

CTC CAT GAG ATC AAT GAA TGC AGC TCT CAT CCA TGC CTG AAT GAG GGA 3120 
Leu His Glu He Asn Glu Cys Ser Ser His Pro Cys Leu Asn Glu Gly 
1025 1030 1035 

ACG TGT GTT GAT GGC CTG GGT ACC TAC CGC TGC AGC TGC CCC CTG GGC 3168 
Thr Cys Val Asp Gly Leu Gly Thr Tyr Arg Cys Ser Cys Pro Leu Gly 
1040 1045 1050 

TAC ACT GGG AAA AAC TGT CAG ACC CTG GTG AAT CTC TGC AGT CGG TCT 3216 
Tyr Thr Gly Lys Asn Cys Gin Thr Leu Val Asn Leu Cys Ser Arg Ser 
1055 1060 1065 

CCA TGT AAA AAC AAA GGT ACT TGT GTT CAG AAA AAA GCA GAG TCC CAG 3264 
Pro Cys Lys Asn Lys Gly Thr Cys Val Gin Lys Lys Ala Glu Ser Gin 
1070 1075 1080 1085 

TGC CTA TGT CCA TCT GGA TGG GCT GGT GCC TAT TGT GAC GTG CCC AAT 3312 
Cys Leu Cys Pro Ser Gly Trp Ala Gly Ala Tyr Cys Asp Val Pro Asn 
1090 1095 1100 

GTC TCT TGT GAC ATA GCA GCC TCC AGG AGA GGT GTG CTT GTT GAA CAC 3360 
Val Ser Cys Asp He Ala Ala Ser Arg Arg Gly Val Leu Val Glu His 
1105 1110 1115 

TTG TGC CAG CAC TCA GGT GTC TGC ATC AAT GCT GGC AAC ACG CAT TAC 3408 
Leu Cys Gin His Ser Gly Val Cys He Asn Ala Gly Asn Thr His Tyr 
1120 ~ 1125 1130 

TGT CAG TGC CCC CTG GGC TAT ACT GGG AGC TAC TGT GAG GAG CAA CTC 3456 
Cys Gin Cys Pro Leu Gly Tyr Thr Gly S r Tyr Cys Glu Glu Gin Leu 
1135 * 1140 1145 

GAT GAG TGT GCG TCC AAC CCC TGC CAG CAC GGG GCA ACA TGC AGT GAC 3504 
Asp Glu Cys Ala Ser Asn Pro Cys Gin His Gly Ala Thr Cys Ser Asp 
1150 1155 1160 1165 
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TTC ATT GGT GGA TAC AGA TGC GAG TGT GTC CCA GGC TAT CAG GGT GTC 3552 
Phe lie Gly Gly Tyr Arg Cys Glu Cys Val Pro Gly Tyr Gin Gly Val 
1170 1175 1180 

AAC TGT GAG TAT GAA GTG GAT GAG TGC CAG AAT CAG CCC TGC CAG AAT 3600 
Asn Cys Glu Tyr Glu Val Asp Glu Cys Gin Asn Gin Pro Cys Gin Asn 
1185 1190 1195 

GGA GGC ACC TGT ATT GAC CTT GTG AAC CAT TTC AAG TGC TCT TGC CCA 3648 
Gly Gly Thr Cys lie Asp Leu Val Asn His Phe Lys Cys Ser Cys Pro 
1200 1205 1210 

CCA GGC ACT CGG GGC CTA CTC TGT GAA GAG AAC ATT GAT GAC TGT GCC 3696 
Pro Gly Thr Arg Gly Leu Leu Cys Glu Glu Asn lie Asp Asp Cys Ala 
1215 1220 1225 

CGG GGT CCC CAT TGC CTT AAT GGT GGT CAG TGC ATG GAT AGG ATT GGA 3744 
Arg Gly Pro His Cys Leu Asn Gly Gly Gin Cys Met Asp Arg He Gly 
1230 1235 1240 1245 

GGC TAC AGT TGT CGC TGC TTG CCT GGC TTT GCT GGG GAG CGT TGT GAG 3792 
Gly Tyr Ser Cys Arg Cys Leu Pro Gly Phe Ala Gly Glu Arg Cys Glu 
1250 1255 1260 

GGA GAC ATC AAC GAG TGC CTC TCC AAC CCC TGC AGC TCT GAG GGC AGC 3840 
Gly Asp He Asn Glu Cys Leu Ser Asn Pro Cys Ser Ser Glu Gly Ser 
1265 1270 1275 

CTG GAC TGT ATA CAG CTC ACC AAT GAC TAC CTG TGT GTT TGC CGT AGT 3888 
Leu Asp Cys He Gin Leu Thr Asn Asp Tyr Leu Cys Val Cys Arg Ser 
1280 1285 1290 

GCC TTT ACT GGC CGG CAC TGT GAA ACC TTC GTC GAT GTG TGT CCC CAG 3936 
Ala Phe Thr Gly Arg His Cys Glu Thr Phe Val Asp Val Cys Pro Gin 
1295 1300 1305 

ATG CCC TGC CTG AAT GGA GGG ACT TGT GCT GTG GCC AGT AAC ATG CCT 3984 
Met Pro Cys Leu Asn Gly Gly Thr Cys Ala Val Ala Ser Asn Met Pro 
1310 1315 1320 1325 

GAT GGT TTC ATT TGC CGT TGT CCC CCG GGA TTT TCC GGG GCA AGG TGC 4032 
Asp Gly Phe He Cys Arg Cys Pro Pro Gly Phe Ser Gly Ala Arg Cys 
1330 1335 1340 

CAG AGC AGC TGT GGA CAA GTG AAA TGT AGG AAG GGG GAG CAG TGT GTG 4080 
Gin Ser Ser Cys Gly Gin Val Lys Cys Arg Lys Gly Glu Gin Cys Val 
1345 1350 1355 

CAC ACC GCC TCT GGA CCC CGC TGC TTC TGC CCC AGT CCC CGG GAC TGC 4128 
His Thr Ala Ser Gly Pro Arg Cys Phe Cys Pro Ser Pro Arg Asp Cys 
1360 ^ 1365 1370 

GAG TCA GGC TGT GCC AGT AGC CCC TGC CAG CAC GGG GGC AGC TGC CAC 4176 
Glu Ser Gly Cys Ala Ser Ser Pro Cys Gin His Gly Gly Ser Cys His 
1375 1380 1385 

CCT CAG CGC CAG CCT CCT TAT TAC TCC TGC CAG TGT GCC CCA CCA TTC 4224 
Pro Gin Arg Gin Pro Pro Tyr Tyr Ser Cys Gin Cys Ala Pro Pro Phe 
1390 1395 1400 1405 

TCG GGT AGC CGC TGT GAA CTC TAC ACG GCA CCC CCC AGC ACC CCT CCT 4272 
Ser Gly Ser Arg Cys Glu Leu Tyr Thr Ala Pro Pro Ser Thr Pro Pro 
1410 1415 1420 

GCC ACC TGT CTG AGC CAG TAT TGT GCC GAC AAA GCT CGG GAT GGC GTC 4320 
Ala Thr Cys Leu Ser Gin Tyr Cys Ala Asp Lys Ala Arg Asp Gly Val 
1425 1430 1435 
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TGT GAT GAG GCC TGC AAC AGC CAT GCC TGC CAG TGG GAT GGG GGT GAC 4368 
Cys Asp Glu Ala Cys Asn Ser His Ala Cys Gin Trp Asp Gly Gly Asp 
1440 1445 1450 

TGT TCT CTC ACC ATG GAG AAC CCC TGG GCC AAC TGC TCC TCC CCA CTT 4416 
Cys Ser Leu Thr Met Glu Asn Pro Trp Ala Asn Cys Ser Ser Pro Leu 
1455 1460 1465 

CCC TGC TGG GAT TAT ATC AAC AAC CAG TGT GAT GAG CTG TGC AAC ACG 4464 
Pro Cys Trp Asp Tyr lie Asn Asn Gin Cys Asp Glu Leu Cys Asn Thr 
1470 1475 1480 1485 

GTC GAG TGC CTG TTT GAC AAC TTT GAA TGC CAG GGG AAC AGC AAG ACA 4512 
Val Glu Cys Leu Phe Asp Asn Phe Glu Cys Gin Gly Asn Ser Lys Thr 
1490 1495 1500 

TGC AAG TAT GAC AAA TAC TGT GCA GAC CAC TTC AAA GAC AAC CAC TGT 4560 
Cys Lys Tyr Asp Lys Tyr Cys Ala Asp His Phe Lys Asp Asn His Cys 
1505 1510 1515 

AAC CAG GGG TGC AAC AGT GAG GAG TGT GGT TGG GAT GGG CTG GAC TGT 4608 
Asn Gin Gly Cys Asn Ser Glu Glu Cys Gly Trp Asp Gly Leu Asp Cys 
1520 1525 1530 

GCT GCT GAC CAA CCT GAG AAC CTG GCA GAA GGT ACC CTG GTT ATT GTG 4656 
Ala Ala Asp Gin Pro Glu Asn Leu Ala Glu Gly Thr Leu Val lie Val 
1535 1540 1545 

GTA TTG ATG CCA CCT GAA CAA CTG CTC CAG GAT GCT CGC AGC TTC TTG 4704 
Val Leu Met Pro Pro Glu Gin Leu Leu Gin Asp Ala Arg Ser Phe Leu 
1550 1555 1560 1565 

CGG GCA CTG GGT ACC CTG CTC CAC ACC AAC CTG CGC ATT AAG CGG GAC 4752 
Arg Ala Leu Gly Thr Leu Leu His Thr Asn Leu Arg lie Lys Arg Asp 
1570 1575 1580 

TCC CAG GGG GAA CTC ATG GTG TAC CCC TAT TAT GGT GAG AAG TCA GCT 4800 
Ser Gin Gly Glu Leu Met Val Tyr Pro Tyr Tyr Gly Glu Lys Ser Ala 
1585 1590 1595 

GCT ATG AAG AAA CAG AGG ATG ACA CGC AGA TCC CTT CCT GGT GAA CAA 4848 
Ala Met Lys Lys Gin Arg Met Thr Arg Arg Ser Leu Pro Gly Glu Gin 
1600 1605 1610 

GAA CAG GAG GTG GCT GGC TCT AAA GTC TTT CTG GAA ATT GAC AAC CGC 4896 
Glu Gin Glu Val Ala Gly Ser Lys Val Phe Leu Glu He Asp Asn Arg 
1615 1620 1625 

CAG TGT GTT CAA GAC TCA GAC CAC TGC TTC AAG AAC ACG GAT GCA GCA 4944 
Gin Cys Val Gin Asp Ser Asp His Cys Phe Lys Asn Thr Asp Ala Ala 
1630 1635 1640 1645 

GCA GCT CTC CTG GCC TCT CAC GCC ATA CAG GGG ACC CTG TCA TAC CCT 4992 
Ala Ala Leu Leu Ala Ser His Ala He Gin Gly Thr Leu Ser Tyr Pro 
1650 1655 1660 

CTT GTG TCT GTC GTC AGT GAA TCC CTG ACT CCA GAA CGC ACT CAG CTC 5040 
Leu Val Ser Val Val Ser Glu Ser Leu Thr Pro Glu Arg Thr Gin Leu 
1665 1670 1675 

CTC TAT CTC CTT GCT GTT GCT GTT GTC ATC ATT CTG TTT ATT ATT CTG 5088 
Leu Tyr Leu Leu Ala Val Ala Val Val He He Leu Phe He He Leu 
1680 1685 1690 

CTG GGG GTA ATC ATG GCA AAA CGA AAG CGT AAG CAT GGC TCT CTC TGG 5136 
Leu Gly Val He Met Ala Lys Arg Lys Arg Lys His Gly Ser Leu Trp 
1695 1700 1705 
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CTG CCT GAA GGT TTC ACT CTT CGC CGA GAT GCA AGC AAT CAC AAG CGT 5184 
Leu Pro Glu Gly Phe Thr Leu Arg Arg Asp Ala Ser Asn His Lys Arg 
1710 1715 " 1720 1725 

CGT GAG CCA GTG GGA CAG GAT GCT GTG GGG CTG AAA AAT CTC TCA GTG 5232 
Arg Glu Pro Val Gly Gin Asp Ala Val Gly Leu Lys Asn Leu Ser Val 
1730 1735 1740 

CAA GTC TCA GAA GCT AAC CTA ATT GGT ACT GGA ACA AGT GAA CAC TGG 5280 
Gin Val Ser Glu Ala Asn Leu lie Gly Thr Gly Thr Ser Glu His Trp 
1745 1750 1755 

GTC GAT GAT GAA GGG CCC CAG CCA AAG AAA GTA AAG GCT GAA GAT GAG 5328 
Val Asp Asp Glu Gly Pro Gin Pro Lys Lys Val Lys Ala Glu Asp Glu 
1760 1765 1770 

GCC TTA CTC TCA GAA GAA GAT GAC CCC ATT GAT CGA CGG CCA TGG ACA 5376 
Ala Leu Leu Ser Glu Glu Asp Asp Pro lie Asp Arg Arg Pro Trp Thr 
1775 1780 1785 

CAG CAG CAC CTT GAA GCT GCA GAC ATC CGT AGG ACA CCA TCG CTG GCT 5424 
Gin Gin His Leu Glu Ala Ala Asp lie Arg Arg Thr Pro Ser Leu Ala 
1790 1795 1800 1805 

CTC ACC CCT CCT CAG GCA GAG CAG GAG GTG GAT GTG TTA GAT GTG AAT 5472 
Leu Thr Pro Pro Gin Ala Glu Gin Glu Val Asp Val Leu Asp Val Asn 
1810 1815 1820 

GTC CGT GGC CCA GAT GGC TGC ACC CCA TTG ATG TTG GCT TCT CTC CGA 5520 
Val Arg Gly Pro Asp Gly Cys Thr Pro Leu Met Leu Ala Ser Leu Arg 
1825 1830 1835 

GGA GGC AGC TCA GAT TTG AGT GAT GAA GAT GAA GAT GCA GAG GAC TCT 5568 
Gly Gly Ser Ser Asp Leu Ser Asp Glu Asp Glu Asp Ala Glu Asp Ser 
1840 1845 1850 

TCT GCT AAC ATC ATC ACA GAC TTG GTC TAC CAG GGT GCC AGC CTC CAG 5616 
Ser Ala Asn lie He Thr Asp Leu Val Tyr Gin Gly Ala Ser Leu Gin 
1855 i860 1865 

GCC CAG ACA GAC CGG ACT GGT GAG ATG GCC CTG CAC CTT GCA GCC CGC 5664 
Ala Gin Thr Asp Arg Thr Gly Glu Met Ala Leu His Leu Ala Ala Arg 
1870 1875 1880 1885 

TAC TCA CGG GCT GAT GCT GCC AAG CGT CTC CTG GAT GCA GGT GCA GAT 5712 
Tyr Ser Arg Ala Asp Ala Ala Lys Arg Leu Leu Asp Ala Gly Ala Asp 
1890 1895 1900 

GCC AAT GCC CAG GAC AAC ATG GGC CGC TGT CCA CTC CAT GCT GCA GTG 5760 
Ala Asn Ala Gin Asp Asn Met Gly Arg Cys Pro Leu His Ala Ala Val 
1905 1910 1915 

GCA GCT GAT GCC CAA GGT GTC TTC CAG ATT CTG ATT CGC AAC CGA GTA 5808 
Ala Ala Asp Ala Gin Gly Val Phe Gin lie Leu lie Arg Asn Arg Val 
1920 1925 1930 

ACT GAT CTA GAT GCC AGG ATG AAT GAT GGT ACT ACA CCC CTG ATC CTG 5856 
Thr Asp Leu Asp Ala Arg Met Asn Asp Gly Thr Thr Pro Leu lie Leu 
1935 1940 1945 

GCT GCC CGC CTG GCT GTG GAG GGA ATG GTG GCA GAA CTG ATC AAC TGC 5904 
Ala Ala Arg Leu Ala Val Glu Gly Met Val Ala Glu Leu He Asn Cys 
1950 1955 1960 1965 

CAA GCG GAT GTG AAT GCA GTG GAT GAC CAT GGA AAA TCT GCT CTT CAC 5952 
Gin Ala Asp Val Asn Ala Val Asp Asp His Gly Lys Ser Ala Leu His 
1970 1975 1980 
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TGG GCA GCT GCT GTC AAT AAT GTG GAG GCA ACT CTT TTG TTG TTG AAA 6000 
Trp Ala Ala Ala Val Asn Asn Val Glu Ala Thr Leu Leu Leu Leu Lys 
1985 1990 1995 

AAT GGG GCC AAC CGA GAC ATG CAG GAC AAC AAG GAA GAG ACA OCT CTG 6048 
Asn Gly Ala Asn Arg Asp Met Gin Asp Asn Lys Glu Glu Thr Pro Leu 
2000 2005 2010 

TTT CTT GCT GCC CGG GAG GGG A6C TAT GAA GCA GCC AAG ATC CTG TTA 6096 
Phe Leu Ala Ala Arg Glu Gly Ser Tyr Glu Ala Ala Lys lie Leu Leu 
2015 2020 2025 

GAC CAT TTT GCC AAT CGA GAC ATC ACA GAC CAT ATG GAT CGT CTT CCC 6144 
Asp His Phe Ala Asn Arg Asp lie Thr Asp His Met Asp Arg Leu Pro 
2030 2035 2040 2045 

CGG GAT GTG GCT CGG GAT CGC ATG CAC CAT GAC ATT GTG CGC CTT CTG 6192 
Arg Asp Val Ala Arg Asp Arg Met His His Asp lie Val Arg Leu Leu 
2050 2055 2060 

GAT GAA TAC AAT GTG ACC CCA AGC CCT CCA GGC ACC GTG TTG ACT TCT 6240 
Asp Glu Tyr Asn Val Thr Pro Ser Pro Pro Gly Thr Val Leu Thr Ser 
2065 2070 2075 

GCT CTC TCA CCT GTC ATC TGT GGG CCC AAC AG A TCT TTC CTC AGC CTG 6288 
Ala Leu Ser Pro Val lie Cys Gly Pro Asn Arg Ser Phe Leu Ser Leu 
2080 2085 2090 

AAG CAC ACC CCA ATG GGC AAG AAG TCT AGA CGG CCC AGT GCC AAG AGT 6336 
Lys His Thr Pro Met Gly Lys Lys Ser Arg Arg Pro Ser Ala Lys Ser 
2095 2100 2105 

ACC ATG CCT ACT AGC CTC CCT AAC CTT GCC AAG GAG GCA AAG GAT GCC 6384 
Thr Met Pro Thr Ser Leu Pro Asn Leu Ala Lys Glu Ala Lys Asp Ala 
2110 2115 2120 2125 

AAG GGT AGT AGG AGG AAG AAG TCT CTG AGT GAG AAG GTC CAA CTG TCT 6432 
Lys Gly Ser Arg Arg Lys Lys Ser Leu Ser Glu Lys Val Gin Leu Ser 
2130 2135 2140 

GAG AGT TCA GTA ACT TTA TCC CCT GTT GAT TCC CTA GAA TCT CCT CAC 6480 
Glu Ser Ser Val Thr Leu Ser Pro Val Asp Ser Leu Glu Ser Pro His 
2145 2150 2155 

ACG TAT GTT TCC GAC ACC ACA TCC TCT CCA ATG ATT ACA TCC CCT GGG 6528 
Thr Tyr Val Ser Asp Thr Thr Ser Ser Pro Met lie Thr Ser Pro Gly 
2160 2165 ' 2170 

ATC TTA CAG GCC TCA CCC AAC CCT ATG TTG GCC ACT GCC GCC CCT CCT 6576 
lie Leu Gin Ala Ser Pro Asn Pro Met Leu Ala Thr Ala Ala Pro Pro 
2175 2180 2185 

GCC CCA GTC CAT GCC CAG CAT GCA CTA TCT TTT TCT AAC CTT CAT GAA 6624 
Ala Pro Val His Ala Gin His Ala Leu Ser Phe Ser Asn Leu His Glu 
2190 2195 2200 2205 

ATG CAG CCT TTG GCA CAT GGG GCC AGC ACT GTG CTT CCC TCA GTG AGC 6672 
Met Gin Pro Leu Ala His Gly Ala Ser Thr Val Leu Pro Ser Val Ser 
2210 * 2215 2220 

CAG TTG CTA TCC CAC CAC CAC ATT GTG TCT CCA GGC AGT GGC AGT GCT 6720 
Gin Leu Leu Ser His His His He Val Ser Pro Gly Ser Gly Ser Ala 
2225 2230 2235 

GGA AGC TTG AGT AGG CTC CAT CCA GTC CCA GTC CCA GCA GAT TGG ATG 6768 
Gly Ser Leu Ser Arg Leu His Pro Vai Pro Val Pro Ala Asp Trp Met 
2240 2245 2250 
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AAC CGC ATG GAG GTG AAT GAG ACC CAG TAC AAT GAG ATG TTT GGT ATG 6816 
Asn Arg Met Glu Val Asn Glu Thr Gin Tyr Asn Glu Met Phe Gly Met 
2255 2260 2265 

GTC CTG GCT CCA GCT GAG GGC ACC CAT CCT GGC ATA GCT CCC CAG AGC 6864 
Val Leu Ala Pro Ala Glu Gly Thr His Pro Gly lie Ala Pro Gin Ser 
2270 2275 2280 2285 

AGG CCA CCT GAA GGG AAG CAC ATA ACC ACC CCT CGG GAG CCC TTG CCC 6912 
Arg Pro Pro Glu Gly Lys His lie Thr Thr Pro Arg Glu Pro Leu Pro 
2290 2295 2300 

CCC ATT GTG ACT TTC CAG CTC ATC CCT AAA GGC AGT ATT GCC CAA CCA 6960 
Pro lie Val Thr Phe Gin Leu lie Pro Lys Gly Ser lie Ala Gin Pro 
2305- 2310 2315 

GCG GGG GCT CCC CAG CCT CAG TCC ACC TGC CCT CCA GCT GTT GCG GGC 7008 
Ala Gly Ala Pro Gin Pro Gin Ser Thr Cys Pro Pro Ala Val Ala Gly 
2320 2325 2330 

CCC CTG CCC ACC ATG TAC CAG ATT CCA GAA ATG GCC CGT TTG CCC AGT 7056 
Pro Leu Pro Thr Met Tyr Gin lie Pro Glu Met Ala Arg Leu Pro Ser 
2335 2340 2345 

GTG GCT TTC CCC ACT GCC ATG ATG CCC CAG CAG GAC GGG CAG GTA GCT 7104 
Val Ala Phe Pro Thr Ala Met Met Pro Gin Gin Asp Gly Gin Val Ala 
2350 2355 2360 2365 

CAG ACC ATT CTC CCA GCC TAT CAT CCT TTC CCA GCC TCT GTG GGC AAG 7152 
Gin Thr lie Leu Pro Ala Tyr His Pro Phe Pro Ala Ser Val Gly Lys 
2370 2375 2380 

TAC CCC ACA CCC CCT TCA CAG CAC AGT TAT GCT TCC TCA AAT GCT GCT 7200 
Tyr Pro Thr Pro Pro Ser Gin His Ser Tyr Ala Ser Ser Asn Ala Ala 
2385 2390 2395 

GAG CGA ACA CCC AGT CAC AGT GGT CAC CTC CAG GGT GAG CAT CCC TAC 7248 
Glu Arg Thr Pro Ser His Ser Gly His Leu Gin Gly Glu His Pro Tyr 
2400 2405 2410 

CTG ACA CCA TCC CCA GAG TCT CCT GAC CAG TGG TCA AGT TCA TCA CCC 7296 
Leu Thr Pro Ser Pro Glu Ser Pro Asp Gin Trp Ser Ser Ser Ser Pro 
2415 2420 2425 

CAC TCT GCT TCT GAC TGG TCA CAT GTG ACC ACC AGC CCT ACC CCT GGG 7344 
His Ser Ala Ser Asp Trp Ser Asp Val Thr Thr Ser Pro Thr Pro Gly 
2430 2435 2440 2445 

GGT GCT GGA GGA GGT CAG CGG GGA CCT GGG ACA CAC ATG TCT GAG CCA 7392 
Gly Ala Gly Gly Gly Gin Arg Gly Pro Gly Thr His Met Ser Glu Pro 
2450 2455 2460 

CCA CAC AAC AAC ATG CAG GTT TAT GCG TGAGAGAGTC CACCTCCAGT 7439 
Pro His Asn Asn Met Gin Val Tyr Ala 
2465 2470 

GTAGAGACAT AACTGACTTT TGTAAATGCT GCTGAGGAAC AAATGAAGGT CATCCGGGAG 7499 

AGAAATGAAG AAATCTCTGG AGCCAGCTTC TAGAGGTAGG AAAGAGAAGA TGTTCTTATT . 7559 

CAGATAATGC AAG AG AAG CA ATTCGTCAGT TTCACTGGGT ATCTGCAAGG CTTATTGATT 7619 

ATTCTAATCT AATAAGACAA GTTTGTGGAA ATG C AAG ATG AATACAAGCC TTGGGTCCAT 7679 

GTTTACTCTC TTCTATTTGG AGAATAAGAT GGATG CTTAT TGAAGCCCAG ACATTCTTGC 7739 



AGCTTGGACT GCATTTTAAG CCCTGCAGGC TTCTGCCATA TCCATGAGAA GATTCTACAC 



7799 
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TAGCGTCCTG 


TTGGGAATTA 


TGCCCTGGAA 


TTCTGCCTGA 


ATTGACCTAC 


GCATCTCCTC 


7859 


CTCCTTGGAC 


ATTCTTTTGT 


CTTCATTTGG 


TGCTTTTGGT 


TTTGCACCTC 


TCCGTGATTG 


7919 


TAGCCCTACC 


AG C ATGTT AT 


AGGGCAAGAC 


CTTTGTGCTT 


TTGATCATTC 


TGGCCCATGA 


7979 


AAGCAACTTT 


GGTCTCCTTT 


CCCCTCCTGT 


CTTCCCGGTA 


TCCCTTGGAG 


TCTCACAAGG 


8039 


TTTACTTTGG 


TATGGTTCTC 


AGGACAAACC 


TTTCAAGTAT 


GTTGTTTCTT 


TGGAAAATGG 


8099 


ACATACTGTA 


TTGTGTTCTC 


CTGCATATAT 


C ATTCCTGG A 


GAGAGAAGGG 


GAGAAGAATA 


8159 


CTTTTCTTCA 


ACAAATTTTG 


GGGGCAGGAG 


ATCCCTTCAA 


G AGG CTGCAC 


CTTAATTTTT 


fi*D 1 Q 


CTTGTCTGTG 


TGCAGGTCTT 


CATATAAACT 


TTACCAGGAA 


GAAGGGTGTG 


AGTTTG TTG T 


O^C /7 


TTTTCTGTGT 


ATGGGCCTGG 


TCAGTGTAAA 


GTTTTATCCT 


TGATAGTCTA 


GTTACTATGA 


CJJ7 


CCCTCCCCAC 


TTTTTTAAAA 


CCAGAAAAAG 


GTTTGGAATG 


TTGGAATGAC 


CAAuAVjALAA 




GTTAACTCGT 


GCAAGAGCCA 


GTTACCCACC 


CACAGGTCCC 


CCTACTTCCT 


GCCAAGCATT 


O A C Q 


CCATTGACTG 


CCTGTATGGA ACACATTTGT 


CCCAGATCTG AGCATTCTAG 


GCCTGTTTCA 


8519 


CTCACTCACC 


CAGCATATGA 


AACTAGTCTT 


AACTGTTGAG CCTTTCCTTT 


CATATCCACA 


8579 


GAAGACACTG 


TCTCAAATGT 


TGTACCCTTG 


CCATTTAGGA CTGAACTTTC 


CTTAGCCCAA 


8639 


GGGACCCAGT 


GACAGTTGTC 


TTCCGTTTGT 


CAGATGATCA GTCTCTACTG 


ATTATCTTGC 


8699 


TGCTTAAAGG 


CCTGCTCACC 


AATCTTTCTT 


TCACACCGTG 


TGGTCCGTGT 


TACTGGTATA 


8759 


CCCAGTATGT 


TCTCACTGAA 


GAGATGGACT 


TTATATGTTC 


AAGTGCAGGA 


ATTGGAAAGT 


8819 


TGGACTTGTT TTCTATGATC CAAAACAGCC 


CTATAAGAAG 


GTTGGAAAAG 


GAGGAACTAT 


8879 


ATAGCAGCCT 


TTG CTATTTT 


CTGCTACCAT 


TTCTTTTCCT 


CTGAAGCGGC 


CATGACATTC 


8939 


CCTTTGGCAA 


CTAACGTAGA 


AACTCAAC AG 


AACATTTTCC 


TTTCCTAGAG 


TGACCTTTTA 


8999 


GATGATAATG 


GACAACTATA 


GACTTGCTCA 


TTGTTCAGAC 


TGATTGCCCC 


TCACCTGAAT 


9059 


CCACTCTCTG 


TATTCATGCT 


CTTGGCAATT 


TCTTTGACTT 


TCTTTTAAGG 


GC AG AAG CAT 


9119 


TTTAGTTAAT 


TGTAGATAAA 


GAATAGTTTT 


CTTCCTCTTC 


TCCTTGGGCC 


AGTTAATAAT 


9179 


TGGTCCATGG 


CTACACTGCA ACTTCCGTCC 


AGTGCTGTGA 


TG CCCATG AC 


ACCTG C AAAA 


9239 


TAAGTTCTGC 


CTGGGCATTT 


TGTAGATATT 


AACAGGTGAA 


TTCCCGACTC 


TTTTGGTTTG 


9299 


AATGACAGTT 


CTCATTCCTT 


CTATGGCTGC 


AAGTATGCAT 


CAGTG CTTCC 


CACTTACCTG 




ATTTGTCTGT 


CGGTGGCCCC 


ATATGGAAAC 


CCTGCGTGTC 


TGTTGGCATA 


ATAGTTTACA 


9419 


AATGGTTTTT 


TCAGTCCTAT 


CCAAATTTAT 


TGAACCAACA 


AAAATAATTA 


CTTCTGCCCT 


9479 


GAGATAAGCA 


GATTAAGTTT 


GTTCATTCTC 


TGCTTTATTC 


TCTCCATGTG 


GCAACATTCT 


9539 


GTCAGCCTCT 


TTCATAGTGT 


GCAAACATTT 


TATCATTCTA 


AATGGTGACT 


CTCTGCCCTT . 


9599 


GGACCCATTT 


ATTATTCACA 


GATGGGGAGA 


ACCTATCTGC 


ATGGACCCTC 


ACCATCCTCT 


9659 


GTGCAGCACA 


CACAGTGCAG 


GGAGCCAGTG 


GCGATGGCGA 


TGACTTTCTT 


CCCCTGGGAA 


9719 


TTCC 












9723 
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WHAT IS CLAIMED IS : 

1. A pharmaceutical composition comprising a therapeutically 
effective amount of a Notch protein; and a pharmaceutical ly acceptable carrier. 

2. The composition of claim 1 in which the Notch protein is a 
human Notch protein. 

3. A pharmaceutical composition comprising a therapeutically 
effective amount of a protein, said protein comprising an amino acid sequence 
encoded by the DNA sequence depicted in Figure 8A (SEQ ID NO:5), 8B (SEQ 
ID NO:6), 8C (SEQ ID NO:7), 9A (SEQ ID NO:8), or 9B (SEQ ID NO:9) ? 
which is able to be bound by an antibody to a Notch protein; and a 
pharmaceutical^ acceptable carrier. 

4. A pharmaceutical composition comprising a therapeutically 
effective amount of a protein, said protein comprising a Notch amino acid 
sequence depicted in Figure 8A (SEQ ID NO:5), 8B (SEQ ID NO:6), 8C (SEQ 
ID NO:7), 9A (SEQ ID NO:8), or 9B (SEQ ID NO:9), which displays one or 
more functional activities associated with a full-length Notch protein; and a 
pharmaceutical ly acceptable carrier. 

5. A pharmaceutical composition comprising a therapeutically 
effective amount of a protein, said protein comprising a fragment of a human 
Notch protein consisting essentially of the extracellular domain of the protein; and 
a pharmaceutical ly acceptable carrier. 

6. A pharmaceutical composition comprising a therapeutically 
effective amount of a protein, said protein comprising a region of a Notch protein 
containing the EGF homologous repeats of the protein; and a pharmaceutical ly 
acceptable carrier. 
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7. A pharmaceutical composition comprising a therapeutically 
effective amount of a fragment of a Notch protein lacking a portion of the EGF- 
homologous repeats of the protein, which fragment is able to be bound by an 
antibody to a Notch protein; and a pharmaceutically acceptable carrier. 

5 

8. A pharmaceutical composition comprising a therapeutically 
effective amount of a protein, said protein comprising a functionally active 
portion of a Notch protein; and a pharmaceutically acceptable carrier. 

10 9. The composition of claim 8 in which the Notch protein is a 

human Notch protein. 

10. A pharmaceutical composition comprising a therapeutically 
effective amount of a chimeric protein, said chimeric protein comprising a 

15 functionally active portion of a human Notch protein joined via a peptide bond to 
a sequence of a protein different from the Notch protein; and a pharmaceutically 
acceptable carrier. 

1 1 . The composition of claim 10 in which the functionally active 
20 portion of the Notch protein is encoded by the human cDNA sequence contained 

in plasmid hN3k as deposited with the ATCC and assigned accession number 
68609, or encoded by the human cDNA sequence contained in plasmid hN5k as 
deposited with the ATCC and assigned accession number 68611. 

25 12. A pharmaceutical composition comprising a therapeutically 

effective amount of a protein, said protein comprising the amino acid sequence 
depicted in Figure 10 (SEQ ID NO: 11); and a pharmaceutically acceptable 
carrier. 

30 13. A pharmaceutical composition comprising a therapeutically 

effective amount of a protein, said protein comprising the amino acid sequence 
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depicted in Figure 11 (SEQ ID NO: 13); and a pharmaceutical^ acceptable 
carrier. 

14. A pharmaceutical composition comprising a therapeutically 
effective amount of a protein, said protein comprising the portion of a human 
Notch protein with the greatest homology to the epidermal growth factor-like 
repeats 11 and 12 of the Drosophila Notch sequence as shown in Figure 4 (SEQ 
ID NO: 14); and a pharmaceutical^ acceptable carrier. 

15. A pharmaceutical composition comprising a therapeutically 
effective amount of a derivative or analog of a Notch protein, which derivative or 
analog is characterized by the ability in vitro, when expressed on the surface of a 
first cell, to bind to a Delta protein expressed on the surface of a second cell; and 
a pharmaceutical^ acceptable carrier. 

16. A pharmaceutical composition comprising a therapeutically 
effective amount of a chimeric protein, said chimeric protein comprising a Notch 
protein joined via a peptide bond to a protein sequence of a protein different from 
the Notch protein; and a pharmaceutical^ acceptable carrier. 

17. A pharmaceutical composition comprising a therapeutically 
effective amount of a fragment of a Notch protein, which fragment is 
characterized by the ability in vitro, when expressed on the surface of a first cell, 
to bind to a Delta protein expressed on the surface of a second cell; and a 
pharmaceutical ly acceptable carrier. 

18. A pharmaceutical composition comprising a therapeutically 
effective amount of a chimeric protein, said chimeric protein comprising a 
fragment of a Notch protein joined via a peptide bond to a protein sequence of a 
protein different from the Notch protein, which fragment is characterized by the 
ability in vitro, when expressed on the surface of a first cell, to bind to a Delta 
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protein expressed on the surface of a second cell; and a pharmaceutical ly 
acceptable carrier. 

19. A pharmaceutical composition comprising a therapeutically 
5 effective amount of a protein, said protein comprising a derivative or analog of a 
Delta protein, which derivative or analog is characterized by the ability in vitro, 
when expressed on the surface of a first cell, to bind to a Notch protein expressed 
on the surface of a second cell; and a pharmaceutical^ acceptable carrier. 

10 20. A pharmaceutical composition comprising a therapeutically 

effective amount of a chimeric protein, said chimeric protein comprising a 
fragment of a Delta protein joined via a peptide bond to a protein sequence of a 
protein different from the Delta protein, which fragment is characterized by the 
ability in vitro, when expressed on the surface of a first cell, to bind to a Notch 

15 protein expressed on the surface of a second cell; and a pharmaceutical ly 
acceptable carrier. 

21 . A pharmaceutical composition comprising a therapeutically 
effective amount of a protein, said protein comprising a derivative or analog of a 

20 Serrate protein, which derivative or analog is characterized by the ability in vitro, 
when expressed on the surface of a first cell, to bind to a Notch protein expressed 
on the surface of a second cell; and a pharmaceutically acceptable carrier. 

22. A pharmaceutical composition comprising a therapeutically 
25 effective amount of a derivative or analog of a Notch protein, which derivative or 

analog is characterized by the ability in vitro, when expressed on the surface of a 
first cell, to bind to a second protein expressed on the surface of a second cell, 
which second protein is selected from the group consisting of a Delta protein and 
a Serrate protein; and a pharmaceutically acceptable carrier. 

30 
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23. A pharmaceutical composition comprising a therapeutically 
effective amount of a nucleic acid encoding a Notch protein; and a 
pharmaceutically acceptable carrier. 

5 24. A pharmaceutical composition comprising a therapeutically 

effective amount of a nucleic acid encoding a functionally active portion of a 
human Notch protein; and a pharmaceutically acceptable carrier. 

25. A pharmaceutical composition comprising a therapeutically 
10 effective amount of a nucleic acid encoding the amino acid sequence depicted in 

Figure 10 (SEQ ID NOrll); and a pharmaceutical^ acceptable carrier. 

26. A pharmaceutical composition comprising a therapeutically 
effective amount of a nucleic acid encoding the amino acid sequence depicted in 

15 Figure 11 (SEQ ID NO: 13); and a pharmaceutically acceptable carrier. 

27. A pharmaceutical composition comprising a therapeutically 
effective amount of a nucleic acid encoding a fragment of a Notch protein, which 
fragment is characterized by the ability in vitro, when expressed on the surface of 

20 a first cell, to bind to a Delta protein expressed on the surface of a second cell; 
and a pharmaceutically acceptable carrier. 

28. A pharmaceutical composition comprising a therapeutically 
effective amount of a nucleic acid encoding a chimeric protein, said chimeric 

25 protein comprising a functionally active fragment of a human Notch protein joined 
via a peptide bond to a protein sequence of a protein different from the Notch 
protein; and a pharmaceutically acceptable carrier. 

29. A pharmaceutical composition comprising a therapeutically 
30 effective amount of a nucleic acid encoding a fragment of a Delta protein, which 

fragment is characterized by the ability in vitro , when expressed on the surface of 
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a first cell, to bind to a Notch protein expressed on the surface of a second cell; 
and a pharmaceutical ly acceptable carrier. 

: 30. A pharmaceutical composition comprising a therapeutically 

5 effective amount of a nucleic acid encoding a fragment of a Serrate protein, which 
fragment is characterized by the ability in vitro, when expressed on the surface of 
a first cell, to bind to a Notch protein expressed on the surface of a second cell; 
and a pharmaceutically acceptable carrier. 

10 31. The composition of claim 24 in which the nucleic acid is a 

nucleic acid vector. 

32. A pharmaceutical composition comprising a therapeutically 
effective amount of an antibody which binds to a Notch protein; and a 

15 pharmaceutically acceptable carrier. 

33. A pharmaceutical composition comprising a therapeutically 
effective amount of a fragment or derivative of an antibody to a Notch protein 
containing the idiotype of the antibody; and a pharmaceutically acceptable carrier. 

20 

34. A method of treating or preventing a disease or disorder in a 
subject comprising administering to a subject in need of such treatment or 
prevention a therapeutically effective amount of a molecule which antagonizes the 
function of a Notch protein. 

25 

35. The method according to claim 34 in which the disease or 
disorder is a malignancy characterized by increased Notch activity or increased 
expression of a Notch protein or of a Notch derivative capable of being bound by 
an anti-Notch antibody, relative to said Notch activity or expression in an 

30 analogous non-malignant sample. 
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36. The method according to claim 34 in which the disease or 
disorder is cervical cancer. 

37. The method according to claim 34 in which the disease or 
5 disorder is breast cancer. 

38. The method according to claim 34 in which the disease or 
disorder is colon cancer. 

10 39. The method according to claim 35 in which the malignancy is 

selected from the group consisting of melanoma, seminoma, and lung cancer. 

40. The method according to claim 35 in which the subject is a 

human. 

15 

41. The method according to claim 36, 37 or 38 in which the 
molecule is an antibody to Notch or a portion of said antibody containing the 
binding domain thereof. 

20 42. The method according to claim 36, 37 or 38 in which the 

molecule is a protein consisting of at least the extracellular domain of a Notch 
protein or a portion thereof capable of binding to a Notch. ligand. 

43. The method according to claim 36, 37 or 38 in which the 
25 molecule is a protein consisting of at least the EGF homologous repeats of a 

Notch protein. 

44. The method according to claim 36, 37 or 38 in which the 
molecule is a protein consisting of at least an adhesive fragment ata Notch 

30 protein. 
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45. The method according to claim 36, 37 or 38 in which the 
molecule is an oligonucleotide which (a) consists of at least six nucleotides; (b) 
comprises a sequence complementary to at least a portion of an RNA transcript of 
a Notch gene; and (c) is hybridizable to the RNA transcript. 

46. A method of treating or preventing a disease or disorder in a 
subject in need of such treatment or prevention comprising administering to the 
subject a therapeutically effective amount of a molecule which promotes the 
function of a Notch protein. 

47. A method of treating or preventing a malignancy in a subject 
comprising administering to a subject in need of such treatment or prevention an 
effective amount of a Notch protein. 

48. A method of treating or preventing a malignancy in a subject 
comprising administering to a subject in need of such treatment or prevention an 
effective amount of a functionally active portion of a Notch protein. 

49. The method according to claim 47 in which the Notch protein 
is a human Notch protein. 

50. A method of treating or preventing a malignancy in a subject 
comprising administering to a subject in need of such treatment or prevention an 
effective amount of a chimeric protein, said protein comprising a functionally 
active portion of a Notch protein joined via a peptide bond to a protein sequence 
of a protein different from the Notch protein. 

51. The method according to claim 49 in which the human Notch 
protein comprises the amino acid sequence depicted in Figure 10 (SEQ ID 

NO: 11) or Figure 11 (SEQ ID NO: 13). 
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52. A method of treating or preventing a malignancy in a subject 
comprising administering to a subject in need of such treatment or prevention an 
effective amount of a derivative or analog of a Notch protein, which derivative or 
analog is characterized by the ability in vitro , when expressed on the surface of a 

5 first cell, to bind to a second protein expressed on the surface of a second cell, 
which second protein is selected from the group consisting of a Delta protein and 
a Serrate protein. 

53. A method of treating or preventing a malignancy in a subject 
10 comprising administering to a subject in need of such treatment or prevention an 

effective amount of a derivative or analog of a Delta protein, which derivative or 
analog is characterized by the ability in vitro, when expressed on the surface of a 
first cell, to bind to a Notch protein expressed on the surface of a second cell. 

15 54. a method of treating or preventing a malignancy in a subject 

comprising administering to a subject in need of such treatment or prevention an 
effective amount of a derivative or analog of a Serrate protein, which derivative 
or analog is characterized by the ability in vitro 9 when expressed on the surface of 
a first cell, to bind to a Notch protein expressed on the surface of a second cell. 

20 

55. A method of treating or preventing a malignancy in a subject 
comprising administering to a subject in need of such treatment or prevention an 
effective amount of a nucleic acid encoding a Notch protein. 

25 56. a method of treating or preventing a malignancy in a subject 

comprising administering to a subject in need of such treatment or prevention an 
effective amount of a nucleic acid encoding a functionally active portion of a 
Notch protein. 

30 57. Th e method according to claim 55 in which the subject is 

human and the Notch protein is a human Notch protein. 
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58. A method of treating or preventing a malignancy in a subject 
comprising administering to a subject in need of such treatment or prevention an 
effective amount of a nucleic acid encoding a fragment of a Notch protein, which 
fragment is characterized by the ability in vitro, when expressed on the surface of 
a first cell, to bind to a second protein expressed on the surface of a second cell, 
which second protein is selected from the group consisting of a Delta protein and 
a Serrate protein. 

59. A method of treating or preventing a malignancy in a subject 
comprising administering to a subject in need of such treatment or prevention an 
effective amount of a nucleic acid encoding a fragment of a Delta protein, which 
fragment is characterized by the ability in vitro, when expressed on the surface of 
a first cell, to bind to a Notch protein expressed on the surface of a second cell. 

60. A method of treating or preventing a malignancy in a subject 
comprising administering to a subject in need of such treatment or prevention an 
effective amount of a nucleic acid encoding a fragment of a Serrate protein, which 
fragment is characterized by the ability in vitro, when expressed on the surface of 
a first cell, to bind to a Notch protein expressed on the surface of a second cell. 

61 . A method of treating or preventing a malignancy in a subject 
comprising administering to a subject in need of such treatment or prevention an 
effective amount of antibody to a Notch protein. 

62. The method according to claim 58 in which the antibody is 

monoclonal. 

63. A method for treating a patient with a tumor, of a tumor type 
characterized by expression of a Notch gene, comprising administering to the 
patient an effective amount of an oligonucleotide, which oligonucleotide (a) 
consists of at least six nucleotides; (b) comprises a sequence complementary to at 
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least a portion of an RNA transcript of the Notch gene; and (c) is hybridizable to 
the RNA transcript. 

64. The method according to claim 60 in which the patient is a 
5 human, and the Notch gene is a human gene. 

65. An isolated oligonucleotide consisting of at least six 
nucleotides, and comprising a sequence complementary to at least a portion of an 
RNA transcript of a Notch gene, which oligonucleotide is hybridizable to the 

10 RNA transcript. 

66. A pharmaceutical composition comprising the oligonucleotide 
of claim 65; and a pharmaceutical^ acceptable carrier. 

15 67. A method of inhibiting the expression of a nucleic acid 

sequence encoding a Notch protein in a cell comprising providing the cell with an 
effective amount of the oligonucleotide of claim 65. 

68. A method of diagnosing a disease or disorder characterized by 
20 an aberrant level of Notch protein or activity in a patient, comprising measuring 

the level of Notch protein expression or activity in a sample derived from the 
patient, in which an increase or decrease in Notch protein or activity in the 
patient sample relative to the level found in such a sample from a normal 
individual indicates the presence of the disease or disorder in the patient. 

25 

69. A method of diagnosing a malignancy characterized by an 
increased amou^z of a Notch protein or of a Notch derivative capable of being 
bound by an anti-Notch antibody, comprising measuring the amount of a Notch 
protein or of a Notch derivative capable of being bound by an anti-Notch 

30 antibody, in a sample containing or suspected of containing malignant cells from a 
patient, in which an increase in the amount of the Notch protein or of the Notch 



35 



WO 94/07474 



- 157 - 



PCT/US93/09338 



10 



derivative capable of being bound by an anti-Notch antibody , in the sample, 
relative to said amount found in an analogous sample of non-malignant cells 
indicates the presence of the disease or disorder in the patient. 

70. The method according to claim 69 in which the malignancy is 
cervical cancer. 



breast cancer. 



colon cancer. 



71. The method according to claim 69 in which the malignancy is 



72. The method according to claim 69 in which the malignancy is 



73. The method according to claim 69 in which the malignancy is 
15 selected from the group consisting of melanoma, seminoma, and lung cancer. 

74. The method according to claim 69 in which the amount of the 
Notch protein or derivative is measured by a method comprising contacting the 
sample with an anti-Notch antibody such that immunospecific binding can occur, 

20 and measuring the amount of any immunospecific binding of the antibody that 
occurs. 

75. A method of treating or preventing a nervous system disorder 
in a subject comprising administering to a subject in need of such treatment or 

25 prevention an effective amount of a functionally active portion of a Notch protein. 

76. A method of promoting tissue regeneration or repair in a 
subject comprising administering to a subject an effective amount of a functionally 
active portion of a Notch protein. 

30 
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77. A method of treating a benign dysproliferative disorder in a 
subject comprising administering to a subject in need of such treatment an 
effective amount of a functionally active portion of a Notch protein, in which the 
disorder is selected from the group consisting of cirrhosis of the liver, psoriasis, 
keloids, and baldness. 

78. A substantially purified human Notch protein comprising the 
amino acid sequence encoded by the hN homolog as depicted in Figure 13 from 
amino acid numbers 1 through 2169 (SEQ ID NO:19). 

79. A substantially purified human Notch protein comprising the 
amino acid sequence encoded by the hN homolog as depicted in Figure 13 from 
amino acid numbers about 26 through 2169 (as contained in SEQ ID NO: 19). 

15 80. A substantially purified protein comprising the extracellular 

domain of the mature human Notch protein encoded by the hN homolog, as 
depicted in Figure 13 from amino acid numbers about 26 through 1677 (as 
contained in SEQ ID NO: 19). 

20 81. A substantially purified protein comprising the EGF 

homologous repeats of the mature human Notch protein encoded by the hN 
homolog, as depicted in Figure 13 from amino acid numbers 26 through 1413 (as 
contained in SEQ ID NO: 19). 

25 82. A substantially purified protein comprising the EGF like 

repeats 1 1 and 12 of the mature human Notch protein encoded by the hN 
homolog, as depicted in Figure 13 (as contained in SEQ ID NO: 19). 

83. A substantially purified protein consisting essentially of the 
30 extracellular domain of the mature human Notch protein encoded by the hN 
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homolog, as depicted in Figure 13 from amino acid numbers about 26 through 
1677 (as contained in SEQ ID NO: 19). 

84. A substantially purified nucleic acid encoding the protein of 

5 claim 78. 

85. A substantially purified nucleic acid encoding the protein of 

claim 79. 

10 86. A substantially purified nucleic acid encoding the protein of 

claim 80. 

87. A substantially purified nucleic acid encoding the protein of 

claim 82. 

15 

88. The nucleic acid of claim 85 which is a DNA molecule 
comprising the sequence depicted in Figure 17 from nucleotide numbers 82 
through 7419 (as contained in SEQ ID NO:21). 

2 0 89. The nucleic acid of claim 80 in which the sequence encoding 

the extracellular domain is as presented in Figure 17 (as contained in 
SEQ ID NO:21). 

90. A recombinant cell containing the nucleic acid of claim 84, 87 

25 or 88. 

91. The composition of claim 2 in which the Notch protein 
comprises the amino acid sequence encoded by the hN homolog as depicted in 
Figure 13 from amino acid numbers 26 through 2169 (as contained in 

30 SEQ ID NO: 19). 
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92. A composition comprising a therapeutically effective amount 
of a Notch protein or Notch derivative, said derivative being capable of being 
bound by an anti-Notch antibody, for use as a medicament. 

5 93. A composition comprising a therapeutically effective amount 

of a molecule which antagonizes the function of a Notch protein, for use as a 
medicament. 

94. Use of a composition comprising a molecule which 
10 antagonizes the function of a Notch protein, for the manufacture of a medicament 
for the treatment of cervical cancer, breast cancer, or colon cancer. 



15 
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GAATTCGGAG GAATTATTCA AAACATAAAC ACAATAAAGA ATTTGAGTAG TTGCCGCACA 60 

CACACACACA CACAGCCCGT GGATTATTAC ACTAAAAGCG ACACTCAATC CAAAAAATCA 120 

GCAACAAAAA CATCAATAAA C ATG CAT TGG ATT AAA TGT TTA TTA ACA GCA 171 

Met His Trp He Lys Cys Leu Leu Thr Ala 
1 5 10 

TTC ATT TGC TTC ACA GTC ATC GTG CAG GTT CAC AGT TCC GGC AGC TTT 219 
Phe He Cys Phe Thr Vol lie Vol Gin Val His Ser Ser Gly Ser Phe 
15 20 25 

GAG TTG CGC CTG AAG TAC TTC AGC AAC GAT CAC GGG CGG GAC AAC GAG 267 
Glu Leu Arg Leu Lys Tyr Phe Ser Asn Asp His Gly Arg Asp Asn Glu 
30 ' 35 40 

GGT CGC TGC TGC AGC GGG GAG TCG GAC GGA GCG ACG GGC AAG TGC CTG 315 
Gly Arg Cys Cys Ser Gly Glu Ser Asp Gly Ala Thr Gly Lys Cys Leu 
45 50 55 

GGC AGC TGC AAG ACG CGG TTT CGC GTC TGC CIA AAG CAC TAC CAG GCC 363 
Gly Ser Cys Lys Thr Arg Phe Arg Val Cys Leu Lys His Tyr Gin Ala 
60 65 70 

ACC ATC GAC ACC AGC TCC CAG TGC ACC TAC GGG GAC GTG ATC ACG CCC 411 
Thr He Asp Thr Thr Ser Gin Cys Thr Tyr Gly Asp Vol He Thr Pro 
75 80 85 90 

ATT CTG GGC GAG AAC TCG GTC AAT CTG ACC GAC GCC GAG GGC TTC CAG 459 
He Leu Gly Glu Asn Ser Val Asn Leu Thr Asp Ala Gin Arg Phe Gin 
95 100 105 

AAC AAG GGC TTC ACG AAT CCC ATC CAG TTC CCC TTC TCG TTC TCA TGG 507 
Asn Lys Gly Phe Thr Asn Pro He Gin Phe Pro Phe Ser Phe Ser Trp 
110 115 120 
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CCG GGT ACC TTC TCG CTG ATC GTC GAG GCC TGG CAT GAT ACG AAC AAT 555 

Pro Gly Thr Phe Ser Leu lie Vol Glu Ala TrD His Asp Thr Asn Asn 
125 130 135 

AGC GGC AAT GCG CGA ACC AAC AAG CTC CTC ATC CAG CGA CTC TTG GTG 603 
Ser Gly Asn Ala Arg Thr Asn Lys Leu Leu He Gin Arg Leu Leu Val 
140 145 150 

CAG CAG GTA CTG GAG GTG TCC TCC GAA TGG AAG ACG AAC AAG TCG GAA 651 
Gin Gin Val Leu Glu Val Ser Ser Glu Trp Lys Thr Asn Lys Ser Glu 
155 160 165 170 

TCG CAG TAC ACG TCG CTG GAG TAC GAT TTC CGT GTC ACC TGC GAT CTC 699 
Ser Gin Tyr Thr Ser Leu Glu Tyr Asp Phe Arg Val Thr Cys Asp Leu 
175 180 185 

AAC TAC TAC GGA TCC GGC TGT GCC AAG TTC TGC CGG CCC CGC GAC GAT 747 
Asn Tyr Tyr Gly Ser Gly Cys Ala Lys Phe Cys Arg Pro Arg Asp Asp 
190 195 200 

TCA TTT GGA CAC TCG ACT TGC TCG GAG ACG GGC GAA ATT ATC TGT TTG 795 
Ser Phe Gly His Ser Thr Cys Ser Glu Thr Gly Glu He He Cys Leu 
205 210 215 

ACC GGA TGG CAG GGC GAT TAC TGT CAC ATA CCC AAA TGC GCC AAA GGC 843 
Thr Gly Trp Gin Gly Asp Tyr Cys His He Pro Lys Cys Ala Lys Gly 
220 225 230 

TGT GAA CAT GGA CAT TGC GAC AAA CCC AAT CAA TGC GTT TGC CAA CTG 891 
Cys Glu His Gly His Cys Asp Lys Pro Asn Gin Cys Val Cys Gin Leu 
235 ' 240 245 250 

GGC TGG AAG GGA GCC TTG TGC AAC GAG TGC GTT CTG GAA CCG AAC TGC 939 
Gly Trp Lys Gly Ala Leu Cys Asn Glu Cys Val Leu Glu Pro Asn Cys 
255 260 265 
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ATC CAT GGC ACC TGC AAC AAA CCC TGG ACT TGC ATC TGC AAC GAG GGT 987 

He His Gly Thr Cys Asn Lys Pro Trp Thr Cys lie Cys Asn Glu Gly 
270 275 280 

TGG GGA GGC TTG TAC TGC AAC CAG GAT CTG AAC TAC TGC ACC AAC CAC 1035 
Trp Gly Gly Leu Tyr Cys Asn Gin Asp Leu Asn Tyr Cys Thr Asn His 
285 290 295 

AGA CCC TGC AAG AAT GGC GGA ACC TGC TTC AAC ACC GGC GAG GGA TTG 1083 
Arg Pro Cys Lys Asn Gly Gly Thr Cys Phe Asn Thr Gly Glu Gly Leu 
300 305 310 

TAC ACA TGC AAA TGC GCT CCA GGA TAC AGT GGT GAT GAT TGC GAA AAT 1131 
Tyr Thr Cys Lys Cys Ala Pro Gly Tyr Ser Gly Asp Asp Cys Glu Asn 
315 320 325 330 

GAG ATC TAC TCC TGC GAT GCC GAT GTC AAT CCC TGC CAG AAT GGT GGT 1 179 
Glu He Tyr Ser Cys Asp Ala Asp VqI Asn Pro Cys Gin Asn Gly Gly 
335 340 ' 345 

ACC TGC ATC GAT GAG CCG CAC ACA AAA ACC GGC TAC AAG TGT CAT TGC 1227 
Thr Cys lie Asp Glu Pro His Thr Lys Thr Gly Tyr Lys Cys His Cys 
350 355 360 

GCC AAC GGC TGG AGC GGA AAG ATG TGC GAG GAG AAA GTG CTC ACG TGT 1275 
Ala Asn Gly Trp Ser Gly Lys Met Cys Glu Glu Lys Vol Leu Thr Cys 
365 370 375 

TCG GAC AAA CCC TGT CAT CAG GGA ATC TGC CGC AAC GTT CGT CCT GGC 1323 
Ser Asp Lys Pro Cys His Gin Gly He Cys Arg Asn Vol Arg Pro Gly 
380 385 ' 390 

TTG GGA AGC AAG GGT CAG GGC TAC CAG TGC GAA TGT CCC ATT GGC TAC 1371 
Leu Gly Ser Lys Gly Gin Gly Tyr Gin Cys Glu Cys Pro lie Gly Tyr 
395 400 405 410 



FIG.1C 



SUBSTITUTE SHEETfRULE 26) 



WO 94/07474 



PCT/LS93/09338 



4/68 

AGC GGA CCC AAC TGC GAT CTC CAG CTG GAC AAC TGC ACT CCG AAT CCA 1419 

Ser Gly Pro Asn Cys Asp Leu Gin Leu Asp Asn Cys Ser Pro Asn Pro 

415 420 425 

TGC ATA AAC GGT GGA AGC TGT CAG CCG AGC GGA AAG TGT ATT TGC CCA 1467 
Cys He Asn Gly Gly Ser Cys Gin Pro Ser Gly Lys Cys He Cys Pro 
430 435 440 

GCG GGA TTT TCG GGA ACG AGA TGC GAG ACC AAC ATT GAC GAT TGT CTT 1515 
Ala Gly Phe Ser Gly Thr Arg Cys Glu Thr Asn Me Asp Asp Cys Leu 
445 450 455 

GGC CAC CAG TGC GAG AAC GGA GGC ACC TGC ATA GAT ATG GTC AAC CAA 1563 
Gly His Gin Cys Glu Asn Gly Gly Thr Cys He Asp Met Vol Asn Gin 
460 465 470 

TAT CGC TGC CAA TGC GTT CCC GGT TTC CAT GGC ACC CAC TGT AGT AGC 1611 
Tyr Arg Cys Gin Cys Val Pro Gly Phe His Gly Thr His Cys Ser Ser 
475 480 ' 485 490 

AAA GTT GAC TTG TGC CTC ATC AGA CCG TGT GCC AAT GGA GGA ACC TGC 1659 
Lys Val Asp Leu Cys Leu I le Arg Pro Cys Ala Asn Gly Gly Thr Cys 
495 500 505 

TTG AAT CTC AAC AAC GAT TAC CAG TGC ACC TGT CGT GCG GGA TTT ACT 1707 
Leu Asn Leu Asn Asn Asp Tyr Gin Cys Thr Cys Arg Ala Gly Phe Thr 
510 515 520 

GGC AAG GAT TGC TCT GTG GAC ATC GAT GAG TGC AGC AGT GGA CCC TGT 1755 
Gly Lys Asp Cys Ser Val Asp He Asp Glu Cys Ser Ser Gly Pro Cys 
525 530 535 

CAT AAC GGC GGC ACT TGC ATG AAC CGC GTC AAT TCG TTC CAA TGC GTG 1803 
His Asn Gly Gly Thr Cys Met Asn Arg Val Asn Ser Phe Glu Cys Val 
540 ' ' 545 550 
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TGT GCC AAT GGT TTC AGG GGC AAG CAG TGC GAT GAG GAG TCC TAC GAT 1851 
Cys Ala Asn Gly Phe Arg Gly Lys Gin Cys Asp Glu Glu Ser Tyr Asp 
555 560 565 570 

TCG GTG ACC TTC GAT GCC CAC CAA TAT GGA GCG ACC ACA CAA.GCG AGA 1899 
Ser Val Thr Phe Asp Ala His Gin Tyr Gly Ala Thr Thr Gin Ala Arg 
575 580 585 

GCC GAT GGT TTG ACC AAT GCC CAG GTA GTC CTA ATT GCT GTT TTC TCC 1947 
Ala Asp Gly Leu Thr Asn Ala Gin Val Val Leu He Ala Val Phe Ser 
590 595 600 

GTT GCG ATG CCT TTG GTG GCG GTT ATT GCG GCG TGC GTG GTC TTC TGC 1995 
Val Ala Met Pro Leu Val Ala Val lie Ala Ala Cys Val Val Phe Cys 
605 610 615 

ATG AAG CGC AAG CGT AAG CGT GCT CAG GAA AAG GAC GAC GCG GAG GCC 2043 
Met Lys Arg Lys Arg Lys Arg Ala Gin Glu Lys Asp Asp Ala Glu Ala 
620 625 630 

AGG AAG CAG AAC GAA CAG AAT GCG GTG GCC ACA ATG CAT CAC AAT GGC 2091 
Arg Lys Gin Asn Glu Gin Asn Ala Val Ala Thr Met His His Asn Gly 
635 640 645 650 

AGT GGG GTG GGT GTA GCT TTG GCT TCA GCC TCT CTG GGC GGC AAA ACT 2139 
Ser Gly Val Gly Val Ala Leu Ala Ser Ala Ser Leu Gly Gly Lys Thr 
655 660 665 

GGC AGC AAC AGC GGT CTC ACC TTC GAT GGC GGC AAC CCG AAT ATC ATC 2187 
Gly Ser Asn Ser Gly Leu Thr Phe Asp Gly Gly Asn Pro Asn lie He 
670 675 680 

AAA AAC ACC TGG GAC AAG TCG GTC AAC AAC ATT TGT GCC TCA GCA GCA 2235 
Lys Asn Thr Trp Asp Lys Ser Val Asn Asn He Cys Ala Ser Ala Ala 
685 690 695 
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GCA GCG GCG GCG GCG GCA GCA GCG GCG GAC GAG TGT CTC ATG TAC GGC 2283 
Ala Ala Ala Ala Ala Ala Ala Ala Ala Asp Glu Cys Leu Met Tyr Gly 
700 705 710 

GGA TAT GTG GCC TCG GTG GCG GAT AAC AAC AAT GCC AAC TCA GAC TTT 2331 
Gly Tyr Val Ala Ser Val Ala Asp Asn Asn Asn Ala Asn Ser Asp Phe 
715 720 725 730 

TGT GTG GCT CCG CTA CAA AGA GCC AAG TCG CAA AAG CAA CTC AAC ACC 2379 
Cys Val Ala Pro Leu Gin Arg Ala Lys Ser Gin Lys Gin Leu Asn Thr 
735 ' 740 745 

GAT CCC ACG CTC ATG CAC CGC GGT TCG CCG GCA GGC AGC TCA GCC AAG 2427 
Asp Pro Thr Leu Met His Arg Gly Ser Pro Ala Gly Ser Ser Ala Lys 
750 755 760 

GGA GCG TCT GGC GGA GGA CCG GGA GCG GCG GAG GGC AAG AGG ATC TCI 2475 
Gly Ala Ser Gly Gly Gly Pro Gly Ala Ala Glu Gly Lys Arg lie Ser 
765 ' 770 775 

GTT TTA GGC GAG GGT TCC TAC TGT AGC CAG.CGT TGG CCC TCG TTG GCG 2523 
Val Leu Gly Glu Gly Ser Tyr Cys Ser Gin Arg Trp Pro Ser Leu Ala 
780 ' * 785 790 

GCG GCG GGA GTG GCC GGA GCC TGT TCA TCC CAG CTA ATG GCT GCA GCT 2571 
Ala Ala Gly Val Ala Gly Ala Cys Ser Ser Gin Leu Met Ala Ala Ala 
795 800 805 810 

TCG GCA GCG GGC AGC GGA GCG GGG ACG GCG CAA CAG CAG CGA TCC GTG 2619 
Ser Ala Ala Gly Ser Gly Ala Gly Thr Ala Gin Gin Gin Arg Ser Val 
815 820 825 

GTC TGC GGC ACT CCG CAT ATG TAACTCCAAA AATCCGGAAG GGCTCCTGGT 2670 
Val Cys Gly Thr Pro His Met 
830 

AAATCCGGAG AAATCCGCAT GGAGGAGCTG ACAGCACATA CACAAAGAAA AGACTGGGTT 2730 

GGGTTCAAAA TGTGAGAGAG ACGCCAAAAT GTTGTTGTTG ATTGAAGCAG TTTAGTCGTC 2790 

ACGAAAAATG AAAAATCTGT AACAGGCATA ACTCGTAAAC TCCCTAAAAA ATTTGTATAG 2850 

TAATTAGCAA AGCTGTGACC CAGCCGTTTC GATCCCGAAT TC 2892 
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1 GAATTCCGCT GGGA6AATGG TCTGAGCTAC CTGCCCGTCC TGCTGGGGCA TCAATGGCAA 

61 GTGGGGAAAG CCAGACTGGG CAAAGGGGCC AGGCCATTTC TGGAATGTGG TACATGGTGG 

121 GCAGGGGGCC CGCAACAGCT GGAGGGCAGG TGGACTGAGG CTGGGGATCC CCCGCTGGTT 

181 GGGCAATACT GCCTTTACCC ATGAGCTGGA AAGTCACAAT GGGGGGCAAG GGCTCCCGAG 

241 GGTGGTTATG TGCTTCCTTC AGGTGGC 



FIG. 8 A 



1 GAATTCCTTC CATTATACGT GACTTTTCTG AAACTGTAGC CACCCTAGTG TCTCTAACTC 

61 CCTCTGGAGT TTGTCAGCTT TGGTCTTTTC AAAGAGCAGG CTCTCTTCAA GCTCCTTAAT 

121 GCGGGCATGC TCCAGTTTGG TCTGCGTCTC AAGATCACCT TTGGTAATTG ATTCTTCTTC 

181 AACCCGGAAC TGAAGGCTGG CTCTCACCCT CTAGGCAGAG CAGGAATTCC GAGGTGGATG 

241 TGTTAGATGT GAATGTCCGT GGCCCAGATG GCTGCACCCC ATTGATGTTG GCTTCTCTCC 

301 GAGGAGGCAG CTCAGATTTG AGTGATGAAG ATGAAGATGC AGAGGACTGT TCTGCTAACA 

361 TCATCACAGA CTTGGTCTAC CAGGGTGGCA GCCTCCAGAC CAGACAGACC GGACTGGTGA 

421 GATGGCCCTG CACCTTGCAG CCCGCTACTC ACGGGCTGAT GCTGCCAAGC GTCTCCTGGA 

481 TGCAGGTGGA GATGCCAATG GCCAGGAGAA CATGGGCCGC TGTCCACTCC ATGCTGCAGT 

541 GGCACGTGAT GCCAAGGTGT ATTCAGATCT GTTA 



FIG,8B 



1 TCCAGATTCT GATTCGCAAC CGAGTAACTG ATCTAGATGC CAGGATGAAT GATGGTACTA 

61 CACCCCTGAT CCTGGCTGCC CGCCTGGCTG TGGAGGGAAT GGTGGCAGAA CTGATCAACT 

121 GCCAAGCGGA TGTGAATGCA GTGGATGACC ATGGAAAATC TGCTCTTCAC TGGGCAGCTG 

181 CTGTCAATAA TGTGGAGGCA ACTCTTTTGT TGTTGAAAAA TGGGGCCAAC CGAGACATGC 

241 AGGACAACAA GGAAGAGACA CCTCTGTTTC TTGCTGCCCG GGAGGAGCTA TAAGC 



FIG.8C 
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1 GAATTCCATT CAGGAGGAAA GGGTGGGGAG AGAAGCAGGC ACCCACTTTC CCGTGGCTGG 

61 ACTCGTTCCC AGGTGGCTCC ACCGGCAGCT GTGACCGCCG CAGGTGGGGG CGGAGTGCCA 

121 TTCAGAAAAT TCCAGAAAAG CCCTACCCCA ACTCGGACGG CAACGTCACA CCCGTGGGTA 

181 GCAACTGGCA CACAAACAGC CAGCGTGTCT GGGGCACGGG GGGATGGCAC CCCCTGCAGG 

241 CAGAGCTG 



1 



FIG.9A 



, CTAAAGGGAA CAAAAGCNGG AGCTCCACCG CGGGCGGCNC NGCTCTAGAA CTAGTGGANN 

61 NCCCGGGCTG CAGGAATTCC GGCGGACTGG GCTCGGGCTC AGAGCGGCGC TGTGGAAGAG 

121 ATTCTAGACC GGGAGAACAA GCGAATGGCT GACAGCTGGC CTCCAAAGTC ACCAGGCTCA 

181 AATCGCTCGC CCTGGACATC GAGGGATGCA GAGGATCAGA ACCGGTACCT GGATGGCATG 

24 1 ACTCGGATTT ACAAGCATGA CCAGCCTGCT TACAGGGAGC GTGANNTTTT CACATGCAGT 

301 CGACAGACAC GAGCTCTATG CAT 



FIG.9B 
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G GAG GTG GAT GTG TTA GAT GTG AAT GTC CGT GGC CCA GAT GGC TGC 46 
Glu Val Asp Vol Leu Asp Val Asn Val Arg Gly Pro Asp Gly Cys 
1 ' 5 10 15 

ACC CCA TTG ATG TTG GCT TCT CTC CGA GGA GGC AGC TCA GAT TTG AGT 94 
Thr Pro Leu Met Leu Ala Ser Leu Arg Gly Gly Ser Ser Asp Leu Ser 
20 25 50 

GAT GAA GAT GAA GAT GCA GAG GAC TCT TCT GCT AAC ATC ATC ACA GAC 142 
Asp Glu Asp Glu Asp Ala Glu Asp Ser Ser Ala Asn He He Thr Asp 
35 40 45 

TTG GTC TAC CAG GGT GCC AGC CTC CAG GCC CAG ACA GAC CGG ACT GGT 190 
Leu Vol Tyr Gin Gly Ala Ser Leu Gin Ala Gin Thr Asp Arg Thr Gly 
50 55 60 

GAG ATG GCC CTG CAC CTT GCA GCC CGC TAC TCA CGG GCT GAT GCT GCC 238 
Glu Met AIq Leu His Leu Ala Ala Arg Tyr Ser Arg Ala Asp Ala Ala 
65 70 ^ 75 

AAG CGT CTC CTG GAT GCA GGT GCA GAT GCC AAT GCC CAG GAC AAC ATG 286 
Lys Arg Leu Leu Asp Ala Gly Ala Asp Ala Asn Ala Gin Asp Asn Met 
80 85 90 95 

GGC CGC TGT CCA CTC CAT GCT GCA GTG GCA GCT GAT GCC CAA GGT GTC 334 
Gly Arg Cys Pro Leu His Ala Ala Val Ala Ala Asp Ala Gin Gly Val 
100 105 110 

TTC CAG ATT CTG ATT CGC AAC CGA GTA ACT GAT CTA GAT GCC AGG ATG 382 
Phe Gin lie Leu He Arg Asn Arg Val Thr Asp Leu Asp Ala Arg Met 
115 120 125 

AAT GAT GGT ACT ACA CCC CTG ATC CTG GCT GCC CGC CTG GCT GTG GAG 430 
Asn Asp Gly Thr Thr Pro Leu He Leu Ala Ala Arg Leu Ala Val Glu 
130 135 140 



FIG. 1 1 A 
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GGA ATG GTG GCA GAA CTG ATC AAC TGC CAA GCG GAT GTG AAT GCA GTG 478 
Gly Met Val Ala Glu Leu lie Asn Cys Gin Ala Asp Val Asn Ala Val 
145 150 155 

GAT GAC CAT GGA AAA TCT GCT CTT CAC TGG GCA GCT GCT GTC AAT AAT 526 
Asp Asp His Gly Lys Ser Ala Leu His Trp Ala Ala Ala Val Asn Asn 
160 165 170 175 

GTG GAG GCA ACT CTT TTG TTG TTG AAA AAT GGG GCC AAC CGA GAC ATG 574 
Val Glu Ala Thr Leu Leu Leu Leu Lys Asn Gly Ala Asn Arg Asp Met 
180 185 ' 190 

CAG GAC AAC AAG GAA GAG ACA CCT CTG TTT CTT GCT GCC CGG GAG GGG 622 
Gin Asp Asn Lys Glu Glu Thr Pro Leu Phe Leu Ala Ala Arg Glu Gly 
195 200 205 

AGC TAT GAA GCA GCC AAG ATC CTG TTA GAC CAT TTT GCC AAT CGA GAC 670 
Ser Tyr Glu Ala Ala Lys He Leu Leu Asp His Phe Ala Asn Arg Asp 
210 215 220 

ATC ACA GAC CAT ATG GAT CGT CTT CCC CGG GAT GTG GCT CGG GAT CGC 718 
He Thr Asp His Met Asp Arg Leu Pro Arg Asp Val Ala Arg Asp Arg 
225 230 235 

ATG CAC CAT GAC ATT GTG CGC CTT CTG GAT GAA TAC AAT GTG ACC CCA 766 
Met His His Asp lie Val Arg Leu Leu Asp Glu Tyr Asn Val Thr Pro 
240 245 250 255 

AGC CCT CCA GGC ACC GTG TTG ACT TCT GCT CTC TCA CCT GTC ATC TGT 814 
Ser Pro Pro Gly Thr Val Leu Thr Ser Ala Leu Ser Pro Val He Cys 
260 265 270 

GGG CCC AAC AGA TCT TTC CTC AGC CTG AAG CAC ACC CCA ATG GGC AAG 862 
Gly Pro Asn Arg Ser Phe Leu Ser Leu Lyn His Thr Pro Met Gly Lys 
275 280 285 
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AAG TCT AGA CGG CCC AGT GCC AAG AGT ACC ATG CCT ACT AGC CTC CCT 910 

Lys Ser Arg Arg Pro Ser Ala Lys Ser Thr Met Pro Thr Ser Leu Pro 

290 " 295 300 

AAC CTT GCC AAG GAG GCA AAG GAT GCC AAG GGT AGT AGG AGG AAG AAG 958 

Asn Leu Ala Lys Glu Alo Lys Asp Ala Lys Gly Ser Arg Arg Lys Lys 
305 310 315 

TCT CTG AGT GAG AAG GTC CAA CTG TCT GAG AGT TCA GTA ACT TTA TCC 1006 

Ser Leu Ser Glu Lys Val Gin Leu Ser Glu Ser Ser Vol Thr Leu Ser 
320 325 330 335 

CCT GTT GAT TCC CTA GAA TCT CCT CAC ACG TAT GTT TCC GAC ACC ACA 1 054 

Pro Val Asp Ser Leu &lu Ser Pro His Thr Tyr Val Ser Asp Thr Thr 
340 345 350 

TCC TCT CCA ATG ATT ACA TCC CCT GGG ATC TTA CAG GCC TCA CCC AAC 1102 

Ser Ser Pro Met lie Thr Ser Pro Gly He Leu Gin Ala Ser Pro Asn 
355 360 365 

CCT ATG TTG GCC ACT GCC GCC CCT CCT GCC CCA GTC CAT GCC CAG CAT 1150 

Pro Met Leu Ala Thr Ala Alo Pro Pro Ala Pro Val His Ala Gin His 

370 375 380 

GCA CTA TCT TTT TCT AAC CTT CAT GAA ATG CAG CCT TTG GCA CAT GGG 1198 

Ala Leu Ser Phe Ser Asn Leu His Glu Met Gin Pro Leu Ala His Gly 
385 390 395 

GCC AGC ACT GTG CTT CCC TCA GTG AGC CAG TTG CTA TCC CAC CAC CAC 1246 

Ala Ser Thr Val Leu Pro Ser Val Ser Gin Leu Leu Ser His His His 
400 405 410 415 

ATT GTG TCT CCA GGC AGT GGC AGT GCT GGA AGC TTG AGT AGG CTC CAT 1294 

I le Val Ser Pro Gly Ser Gly Ser Ala Gly Ser Leu Ser Arg Leu His 
420 425 430 

CCA GTC CCA GTC CCA GCA GAT TGG ATG AAC CGC ATG GAG GTG AAT GAG 1342 

Pro Val Pro Val Pro Ala Asp Trp Met Asn Arg Met Glu Val Asn Glu 
435 440 445 

FIG.11C 
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ACC CAG TAC AAT GAG ATG TTT GGT ATG GTC CTG GCT CCA GCT GAG GGC 1390 
Thr Gin Tyr Asn Glu Met Phe Gly Met Val Leu Ala Pro Ala Glu Glv 
450 455 460 

ACC CAT CCT GGC ATA GCT CCC CAG AGC AGG CCA CCT GAA GGG AAG CAC 1438 
Thr His Pro Gly He AIq Pro Gin Ser Arg Pro Pro Glu Gly Lys His 
465 470 " 475 

ATA ACC ACC CCT CGG GAG CCC TTG CCC CCC ATT GTG ACT TTC CAG CTC 1486 
He Thr Thr Pro Arg Glu Pro Leu Pro Pro He Val Thr Phe Gin Leu 
480 485 490 495 

ATC CCT AAA GGC AGT ATT GCC CAA CCA GCG GGG GCT CCC CAG CCT CAG 1534 
He Pro Lys Gly Ser He Ala Gin Pro AIq Gly AIq Pro Gin Pro Gin 
500 505 510 

TCC ACC TGC CCT CCA GCT GTT GCG GGC CCC CTG CCC ACC ATG TAC CAG 1582 
Ser Thr Cys Pro Pro Ala Val AIq Gly Pro Leu Pro Thr Met Tyr Gin 
515 520 525 

ATT CCA GAA ATG GCC CGT TTG CCC AGT GTG GCT TTC CCC ACT GCC ATG 1630 
He Pro Glu Met Ala Arg Leu Pro Ser Val Ala Phe Pro Thr Ala Met 
530 535 540 

ATG CCC CAG CAG GAC GGG CAG GTA GCT CAG ACC ATT CTC CCA GCC TAT 1678 
Met Pro Gin Gin Asp Gly Gin Val Ala Gin Thr He Leu Pro Ala Tyr 
545 550 555 

CAT CCT TTC CCA GCC TCT GTG GGC AAG TAC CCC ACA CCC CCT TCA CAG 1726 
His Pro Phe Pro Ala Ser Val Gly Lys Tyr Pro Thr Pro Pro Ser Gin 
560 565 570 575 

CAC AGT TAT GCT TCC TCA AAT GCT GCT GAG CGA ACA CCC AGT CAC AGT 1774 
His Ser Tyr Ala Ser Ser Asn Ala Ala Glu Arg Thr Pro Ser His Ser 
580 585 * 590 

GGT CAC CTC CAG GGT GAG CAT CCC TAC CTG ACA CCA TCC CCA GAG TCT 1822 
Gly His Leu Gin Gly Glu His Pro Tyr Leu Thr Pro Ser Pro Glu Ser 
595 600 605 

FIG.11D 
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CCT GAC CAG TGG TCA AGT TCA TCA CCC CAC TCT GCT TCT GAC TGG TCA 1870 

Pro Asp Gin Trp Ser Ser Ser Ser Pro His Ser AIq Ser Asp Trp Ser 
610 615 620 

GAT GTG ACC ACC AGC CCT ACC CCT GGG GGT GCT GGA GGA GGT CAG CGG 1918 

Asp Vat Thr Thr Ser Pro Thr Pro Gly Gly Ala Gly Gly Gly Gin Arg 
625 630 635 

GGA CCT GGG ACA CAC ATG TCT GAG CCA CCA CAC AAC AAC ATG CAG GTT 1966 

Gly Pro Gly Thr His Met Ser Glu Pro Pro His Asn Asn Met Gin Vol 
640 645 650 655 

TAT GCG TGAGAGAGTC CACCTCCAGT GTAGAGACAT AACTGACTTT TGTAAATGCT 2022 
Tyr AIq 



GCTGAGGAAC 


AAATGAAGGT 


CATCCGGGAG 


AGAAATGAAG 
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TTACCAGGAA GAAGGGTGTG AGTTTGTTGT TTTTCTGTGT ATGGGCCTGG TCAGTGTAAA 2862 

GTTTTATCCT TGATAGTCTA GTTACTATGA CCCTCCCCAC TTTTTTAAAA CCAGAAAAAG 2922 

GTTTGGAATG TTGGAATGAC CAAGAGACAA GTTAACTCGT GCAAGAGCCA GTTACCCACC 2982 

CACAGGTCCC CCTACTTCCT GCCAAGCATT CCATTGACTG CCTGTATGGA ACACATTTGT 3042 

CCCAGATCTG AGCATTCTAG GCCTGTTTCA CTCACTCACC CAGCATATGA AACTAGTCTT 3102 

AACTGTTGAG CCTTTCCTTT CATATCCACA GAAGACACTG TCTCAAATGT TGTACCCTTG 3162 

CCATTTAGGA CTGAACTTTC CTTAGCCCAA GGGACCCAGT GACAGTTGTC TTCCGTTTGT 3222 

CAGATGATCA GTCTCTACTG ATTATCTTGC TGCTTAAAGG CCTGCTCACC AATCTTTCTT 3282 

TCACACCGTG TGGTCCGTGT TACTGGTATA CCCAGTATGT TCTCACTGAA GACATGGACT 3342 

TTATA7GTTC AAGTGCAGGA ATTGGAAAGT TGGACTTGTT TTCTATGATC CAAAACAGCC 3402 

CTATAAGAAG GTTGGAAAAG GAGGAACTAT ATAGCAGCCT TTGCTATTTT CTGCTACCAT 3462 

TTCTTTTCCT CTGAAGCGGC CATGACATTC CCTTTGGCAA CTAACGTAGA AACTCAACAG 3522 
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Potentiol signol cleovoge site-^ 



hum N MP ALRPAl LWALLALWLC CA APA HA- 

TAN-1 MP PL LAPLLCLALL PA LAA RG- 

Xen N md RIGLAVLLCS LP VLT QG- 



Dros N MQSQRSRRRS RAPNTWICFW INKMHAVASL PASLPLLLLT LAFANLPNIV RGTDTALVAA. 

hum N i MLGKATCRCA SGFTGEDCQY STSHPCFVSR PCLNGGTCHM LSRDT-YECT COVGFTGKEC 

Ton-1 i GVAOYACSCA LGFSGPLCLT PLDNAC-LTN PCRNGGTCDL LT-LTEYKCR CPPGWSGKSC )>- 

Xen N i NA10FICHCP VGFTDKVCLT PVDNAC-VNN PCRNGGTCEL LNSVTEYKCR CPPGWTGDSC 

Dros N | GRPG1SCKCP LGFDESLCE1 AVPNAC-DHV TCLNGGTCQL KT-LEEYTCA CANGYTCERC 

hum N i NLPGSYQCQC PQGFTGQYCD SLYVPCAPSP CVNGGTCRQT GDFTFECNCL PGFEGSTCER 

TAN-1 : NEVGSYRCVC RATHTGPNCE RPYVPCSPSP CQMGGTCRPT GDVTHECACL PGFTGQNCEE 

Xen N • NEFGSYRCTC QNRFTGRNCD EPYVPCNPSP CLNGGTCRQT DDTSYDCTCL PGFSGQNCEE 

Dros N NTHGSYQCMC PTGYTGKDCD TKYNPCSPSP CGNAG1CRSN G-LSYECKCP KGFEGKNCEO 



J 



r 



EGF-like Repeots 



QCRDGYEPCV NEGMCVTYHN GTGYCKCPEG FLGEYCQHRO PCE-KNRCQN GGTC— VAOA 83 

RCSQPGETCL NGGKCEA-AN GTEACVCGGA FVGPRCQDPN PCL-STPCKN AGTCHWDRR 80 

RCTQTAEMCL NGGRCEMTPG GTGVCLCGNL YFGERCQFPN PCTIKNQCMN FGTCEPVLQG 90 

SCTSVG-CQ NGGTCVTQLN GKTYCACDSH YVGDYCEHRN PCN-SMRCQN GGTCQVTFRN 117 

GWTDACLSHP CANGSTCTTV — ANQFSCKC LTGFTGQKCE TDVNEC-DIP GHCQHGGTCL 199 

QOADPCASNP CANGGQCLPF — EASYICHC PPSFHGPTCR ODVNECGQKP RLCRHGGTCH 196 

QQADPCASNP CANGGKCLPF — EIQYICKC PPGFHGATCK QOINEC-S-0 NPCKNGGQC1 195 

ETKNLCASSP CRNGATCTAL AGSSSFTCSC PPGFTGDTCS YD1EEC-Q-S NPCKYGGICV 233 

NIDOCPNHRC QNGGVCVDGV NTYNCRCPPQ WTGQFCTEDV DECLLQPNA- CQNGGTCANR 318 

NIDDCPGNNC KNGGACVOGV NTYNCPCPPE WTGQYCTEDV DECQLMPNA- CQNGGTCHNT 315 

NIDDCPSNNC RNGGTCVOGV NTYNCQCPPD WTGQYCTEDV DECQLMPNA- CQNGGTCHNT 314 

NYDDCLGHLC QNGGTCIDGI SDYTCRCPPN FTGRFCQDDV DECAQRDHPV CQNGATCTNT 352 
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iNGGYGCVCVN GWSGDDCSEN IDDCAFASCT PGSTCIDRVA SFSCMCPEGK AGLLCHLDDA^ 
iHGGYNCVCVN GWTGEDCSEN IDDCASAACF HGATCHDRVA SFYCECPHGR TGLLCHLNDA 
YGGYNCVCVN GWTGEDCSEN IDDCANAACH SGATCHDRVA SFYCECPHGR TGLLCHLDNA 
iHGSYSCICVN GWAGLDCSNN TDDCKQAACF YGATCIDGVG SFYCQCTKGK TGLLCHLDDA 



AFHCECLKGY AGPRCEMDIN ECHSDPCQND ATCLDKIGGF TCLCMPGFKG VHCELEINEC 
SFECQCLQGY TGPRCEIDVN ECVSNPCOND ATCLDQIGEF QCMCMPGYEG VHCEVNTDEC 
SFQCNCPQGY AGPRCEIDVN ECLSNPCQND STCLDOIGEF QCICMPGYEG LYCETNIDEC 
SYRCNCSQGF TGPRCETNIN ECESHPCONE GSCLDDPGTF RCVCMPGFTG TKEIDIDEC 



hum N ATGFTGVLCE EN1DNCDPDP CHHGXQDGI DSYTC1CNPG YMGAICSDOI DECYSSPCLN 

TAN-1 TEGYTGTHCE VDIDECDPDP CHYGSCKDGV ATFTCLCRPG YTGHHCETNI NECSSOPCRL 

Xen N TEGFTGRHCE QD1NECIPDP CHYGTCKDGI ATFTCLCRPG YTGRLCDND1 NECLSKPCLN 

Dros N PPGYTGTSCE ININDCDSNP CHRGKCIDDV NSFKCLCDPG YTGYICOKOI NECESNPCOF 



L. 



r. 



CISNPCHKGA LCDTNPLNGQ YICTCPQGYK GAXTEDVDE CAMANSNPCE HAGKCVNTDG '. 438 

CISNPCNEGS NCOTNPVNGK AICTCPSGYT GPACSQDVDE CSLG-ANPCE HAGKCINTLG I 434 

CISNPCNEGS NCOTNPVNGK A1CTCPPGYT GPACNNDVDE CSLG-ANPCE HGGRCTNTLG • 433 

CTSNPCHADA 1CDTSP1NGS YACSCATGYK GVDCSED1DE CDQG-SPCE HNGICVNTPG ; 470 

QSNPCVNNGQ CVDKVNRFX LCPPGFTGPV COIOIDDCSS TPCLNGAKCI DHPNGYECQC 558 

ASSPCLHNGR CLDKINEFQC ECPTGFTGHL CQYDVDECAS TPCKNGAKCL DGPNTYTCVC 554 

ASNPCLHNGK CIDKINEFRC DCPTGFSGNL CQHDFDECTS TPCKNGAKCL DGPNSYTCOC 553 

QSNPCLNDGT CHDKINGFKC SCALGFTGAR CQ1N1DDCQS QPCRNRGICH DS1AGYSCEC 590 

DGRCIDLVNG YQCNCQPGTS GVNCEINFDD CASNPCIHG- ICMDGINRYS CVCSPGFTGQ 677 

RGTCQDPDNA YLCFCLKGTT GPNCEINLDD CASSPCDSG- TCLDKIDGYE CACEPGYTGS 673 

GGXTDRENG YICTCPKGTT GVNCETKIDD CASNLCDNG- KCIDKIOGYE CTCEPGYTGK 672 

DGHCODRVGS YYCQCQAGTS GKNCEVNVNE CHSNPCNNGA TCIOGINSYK CQCVPGFTGO 710 
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hum N ! RCNIDIDECA SNPCRKGATC INGVNGFRCI CPEGPHHPSC YSQVNECLSN PCI-HGNCTG 

TAN— 1 i MCNSNIDECA GNPCHNGGTC EDGINGFTCR CPEGYHDPTC LSEVNECNSN PCV-HGACRD 

Xen N ; LCN1NINECD SNPCRNGGTC KDQ1NGFTCV CPDGYHDHMC LSEVNECNSN PCI-HGACHD 

Dros N j HCEKNVDECI 5SPCANNGVC IDQVNGYKCE CPRGFYDAHC LSDVDECASN PCVNEGRCED 



hum N 1 DECASNPCLN QGTCFDDISG YTCHCVLPYT GKNCQTVLAP CSPNPCENAA VCKESPNFES 

TAN— 1 ! NECASNPCLN KGTCIDDVAG YKCNCLLPYT GATCEWLAP CAPSPCRNGG ECRQSEDYES 

Xen N j NECSSNPCLN HGTCIODVAG YKCNCMLPYT GAICEAVLAP CAGSPCKNGG RCKESEDFET 

Dros N ! DDCVTNPCGN GGTC1DKVNG YKCVCKVPFT GRDCESKMDP CASNRCKNEA KCTPSSNFLD 



hum N CLANPCQNGG SCMDGVNTFS CLCLPGFTGD KCQTDNMECL SEPCKNGGTC SDYVNSYTCK 

TAN-1 ! CRPNPCHNGG SCTDGINTAF CDCLPGFRGT FCEEDINECA SDPCRNGANC TDCVOSYTCT 

Xen N CQPNPCHNGG SCSDGINMFF CNCPAGFRGP KCEEDINECA SNPCKNGANC TDCVNSYTCT 

Dros N : CASFPCQNGG TCLDGIGDYS CLCVDGFDGK HCETDINECL SQPCQNGATC SQYVNSYTCT 



| 

1 ' GLSGYKCLCD AGWVGINCEV DKNECLSNPC QNGGTCDNLV NGYRCTCKKG FKGYNCQVNI 796 

SLNGYKCDCD PGWSGTNCDI NNNECESNPC VNGGTCKDMT SGIVCTCREG FSGPNCQTNI 792 

GVNGYKCDCE AGWSGSNCDI NNNECESNPC MNGGTCKDMT GAYICTCKAG FSGPNCQTNI 791 

G1NEFICHCP PGYTGKRCEL DIDECSSNPC QHGGTCYDKL NAFSCQCMPG YTGQKCETNI 830 



YTCLCA-PGW CCQRCTIDID EC-ISKPCMN HGLCHNTQGS YMCECPPGFS GMDCEEDIDD 914 

FSCVCPTAGA KGQTCEVDIN EC-VLSPCRH GASCQNTHGG YRCHCQAGYS GRNCETDIDD 911 

FSCECP-PGW QGQTCEIDMN EC-VNRPCRN GATCONTNGS YKCNCKPGYT GRNCEMDIDD 909 

FSCTCK-LGY TGRYCDED1D ECSLSSPCRN GASCLNVPGS YRCLCTKGYE GRDCAINTDD 949 



COAGFDGVHC ENN1NECTES SCFNGGTCVD GJNSFSCLCP VGFTGSFCLH EINECSSHPC 1034 
CPAGFSGIHC ENNTPDCTES SCFNGGTCVD GINSFTCLCP PGFTGSYCQH WNECDSRPC 1031 
CQPGFSGIHC ESNTPDCTES SCFNGGTCID GINTFTCQCP PGFTGSYCQH DINECDSKPC 1029 
CPLGFSGINC QTNDEDCTES SCLNGGSCID GINGYNCSCL AGYSGANCQY KLNKCDSNPC 1069 
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hum N ; LNEGTCVDGL GTYRCSCPLG YTGKNCOTLV NLCSRSPCKN KGTCVOKKAE SQCLCPSGWA 

TAN-1 LLGGTCQDGR GLHRCTCPQG YTGPNCONLV HWCOSSPCKN GGKCWQTHTQ YRCECPSGWT 

Xen N i LNGG7CQDSY GTYKCTCPQG YTGLNCONLV RWCDSSPCKN GGKCWQTNNF YRCECKSGWT 

Dros N I LNGATCHEQN NEYTCHCPSG FTGKQCSEYV DWCGQSPCEN GATCSQMKHQ FSCKCSAGWT 

hum N iSNPCQHGATC SDF IGGYRCE CVPGYQGVNC EYEVDECQNO PCONGGTCID LVNHFKCSCP 

TAN-1 IPSPCQNGATC TDYLGGYSCK CVAGYHGVNC SEEIDECLSH PCQNGGTCLO LPNTYKCSCP 

Xen N I PNPCQNGATC TDYLGGYSCE CVAGYHGVNC SEEINECLSH PCONGGTCID LINTYKCSCP f 

Dros N iSQPCQNGGTC RDLIGAYECQ CRQGFQGQNC ELN1DDCAPN PCQNGCTCHD RVMNFSCSCP 

hum N iCLSNPCSSEG SLDCIQLTND YLCVCRSAFT GRHCETFVDV CPGMPCLNGG TCAVASNMPD 

TAN-1 CLSNPCOARG TQNCVQRVND FHCECRAGHT GRRCESVING CKGKPCKNGG TCAVASNTAR 

Xen N CLSNPCDSRG TQNC1QLVND YRCECROGFT GRRCESWDG CKGMPCRNGG TCAVASNTER 

Dros N ,'CLSNPCSNAG TLDCVQLVNN YHCNCRPGHM GRHCEHKVDF CAOSPCONGG NCNI — RQS 



J 



r 



GAYCDVPNVS CDIAASRRGV LVEHLCQHSG VCINAGNTHY CQCPLGYTGS YCEEQLDECA 1154 

GLYCDVPSVS CEVAAQRQGV OVARLCQHGG LCVDAGNTHH CRCOAGYTGS YCEDLVDECS 1151 

GVYCDVPSVS CEVAAKQQGV OIVHLCRNSG MCVDTGNTHF CRCOAGYTGS YCEEQVDECS 1149 

GKLCDVQT1S CQDAADRKGL SLRQLC-NNG TCKDYGNSHV CYCSCGYAGS YCQKE1DECQ 1188 

PGTRGLLCEE NIDOCAR GPHCLN GGQCMDRIGG YSCRCLPGFA GERCEGDINE 1267 

RGTQGVHCE I NVDDCNPPVD PVSRSPKCFN NGTCVDQVGG YSCTCPPGFV GERCEGDVNE 1271 

RGTQGVHCEI NVDDCTPFYD SFTLEPKCFN NGKCIDRVGG YNCICPPGFV GERCEGDVNE 1269 

PGTMG11CE1 NKDDCKP GACHN NGSCIDRVGG FECVCQPGFV GARCEGDINE 1300 

GFICRCPPGF SGARCQS — SCGOVKCRKG EXVHTAS- GPRCFCPSP- — RDCES — 1376 

GFICKCPAGF EGATCENDAR TCGSLRCLNG GTCISGPR- SPTCLCLGPF TGPECQFPAS 1389 

GFICKCPPGF DGATCEYDSR TCSNLRCQNG GTCISVLT- SSKCVCSEGY TGATCQYPVI 1387 

y GHHC1CNNGF YGKNCELSGQ DCDSNPCRVG -NCWADEGF GYRCECPRGT LGEHCEIDTL 1415 



FIG. 1 3D 



DOCID: <WO 9407474A1_I_> 



SUBSTITUTE SHEET (RULE 26) 



WO 94/07474 



PCT/US93/09338 



48/68 



hum N 
TAN-1 
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-GC-ASSPCQ HGGSCHPQRQ PPYYSCQCAP PFSGSRCEL- -YTAPP S TPP 

SPCLGGNPCY NQGTCEPTSE SPFYRCLCPA KFNGLLCHIL DYSFGG GAGRDIPPP 

SPC-ASHPCY NGGTCQFFAE EPFFQCFCPK NFNBLFCHI.L DYEFPG GLGKNITPP 

.'DEC-SPNPCA QGAACEDLLG D— YECLCPS KWKGKRCDIY DANYPGWNGG SGSGNDRYAA 

NN-QCDELCN TVECLFDNFE CQGNSKTCK- -YDKYCADHF KDNHCNQGCN SEECGWDGLD 
SDGHCDSQCN SAGCLFDGFD CGRAEGQCNP LYDOYCKDHF SDGHCDQGCN SAECEWDGLD 
NDGKCDSQCN NTGCLYDGFD COKVEVQCNP LYDOYCKDHF QDGHCDQGCN NAECEWDGLD 
KNGKCNEECN NAACHYDGHD CERKLKSCDS LFDAYCQKHY GDGFCDYGCN NAEC SWDGLD 

YYGEKSAAMK KO-R 



MTRRSL PGEQ E QEVAGSKVFL 

YYGREEELRK HPIKRAAEGW AAPDALLGQV KASLLPGGSE GGRRRRELDP MDVRGSIVYL 

YYGNEEELKK HHIKRSTDYW SDAPSAI FSTMKESIL LGRHRRELDE MEVRGSIVYL 

WKDNVRVPEI EDTDFARKNK ILYTQQVHQ TGIQIYL 



r 



J 



LNR (Notch/Lin-12 Repeots) 



r 



— A — TCL SQYCADKARD GVCDEACNSH ACQWDGGDCS LTMENPWANC SSPLPCWDYI 1476 

LIEE — ACE LPECQEDAGN KVCSLOCNNH ACGWDGGDCS LNFNDPWKNC TQSLQCWKYF 1501 

DNDD— ICE NEXSELADN KVCMANCNNH ACGWDGGDCS LNFNDPWKNC TQSLQCWKYF 1498 

DLEQQRA MCD KRGCTEKQGN GICDSDCNTY ACNFDGNDCS LG1-NPWANC TAN-EXWNKF 1531 

CAADOPEN-L AEGTLVIWL MPPEQLLQDA R-SFLRALGT LLHTNLRIKR DSQGELMVYP 1591 

CAEHVPER-L AAGTL-WW LMPPEQLRNS SFHFLRELSR VLHTNWFKR DAHGQQMIFP 1619 

C-mPZN-l AEGTLVLWL MPPERLKNNS V-NFLRELSR VLHTNWFKK DSKGEYKIYP 1615 

CENKTQSPVL AEGAMSWML MNVEAFREIQ A-QFLRNMSH MLRTTVRLKK DALGHDII IN 1650 

TM 

EIDNRQCVQD SDHCFKNTDA AAALLASHAI QG — TLSYP LVSWSESLT PERT-Q-LLY 1680 

EIDNRQCVQA SSQCFQSATD VAAFLGALAS LGSL-NIPYK IEAVQSETVE PPPPAQ-LHF 1737 

EIDNRQCYKS SSQCFNSATD VAAFLGALAS LGSLDTLSYK IEAVKSENME TPKPST-LYP 1730 

EIDNRKCTEC FTHAVEAAEF LAATAAKHQL RNDFQ-IHSV RGIKNPGDED NGEPPANVKY 1745 
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hum N LLAVAW1 IL FI1LLGVIMA KRKRK— HGS LWLPEGFTLR ROASNHKRRE PVGQOAVGLK ^ 

TAN-1 IMYVAAAAFVL LFFVGCGVLL SRKRRROHGQ LWFPEGFKV- SEASKKKRRE ELGEDSVGLK 

Xen N iMLSMLVIPLL I IFVFKWVIV NKKRRREHDS FGSPTALFQK NPA-KRNGET PW-EDSVGLK 

Dros N IVITG1ILVH ALAFFGMVL? - STQRKRAHGV TWFPEGFRAP AAVMSRRRRD PHGQEMRNLN 

CDC-10/Ankyrin Repeots 

hum N PIDRRPWTQQ HLEAA01RRT PSLALTPPOA EOEVDVLDVN VRGPOGCTPL MLASLRGGSS 

TAN-1 QTDHRQWTQQ HLDAAOL-RM SAMAPTPPQG EVDADCMDVN VRGPOGFTPL MIASCSGGGL > 

Xen N KTDPRQWTRQ HLDAADL-RI SSMAPTPPQG E1EADCMDVN VRGPDGFTPL MIASCSGGGL 

Oros N EAOQRVWSOA HLDWDV-R- AIM-TPP-A HQDGGKHD VD ARGPCGLTPL MIAAVRGGGL 

h "n> N ANAQDNMGRC PLHAAVAADA QGVFQILIRN RVTDLOARMM DGTTPLILAA RLAVEGMVAE 

TAN-1 . ANIQDNMGRT PLHAAVSADA QGVFQILIRN RATDLDARMH DGTTPLILAA RLAVEGMLEO 

Xen N :ANVQDNMGRT PLHAAVAADA QGVFQILIRN RATDLDARMF DGTTPLILAA RLAVEGMVEE 

Dros N j ANCQDNTGRT PLHAAVAADA MGVFQILLRN RATNLNARMH DGTTPLILAA RLAIEGMVED > 



r 



-< 



j 



• NLSVQVSEAN L1GTGTSEHW VDDE G PQPKKVKAED EALLSE-EDD 1782 

PLK-NASDGA LMDDNQNE-W GDED LETKKFRFEE PWLPD-LDD 1837 

PIK-NMTDGS FMDDNQNE-W GDEET LENKRFRFEE QVILPELVDD 1831 

KQVAMQSQGV GQPGAH — W SDDESDMPLP KRQRSDPVSG VGLGNNGGYA SDHTMVSEYE 1861 



DLSDEDEDAE DSSANIITDL VYQGASLQAQ TDRTGEMALH LAARYSRADA AKRLLDAGAD 1902 

ETGNSEEE-E DAPA-VISDF IYQGASLHNQ TDRTGETALH LAARYSRSDA AKRLLEASAD 1954 

ETGNSEEE-E DASANMISDF IGQGAQLHNQ TDRTGETALH LAARYARADA AKRLLESSAD 1949 

DTGEDIENNE DSTAQV1SDL LAQGAELNAT MDKTGETSLH LAARFARADA AKRLLDAGAD 1976 

LINCQADVNA VDDHGKSALH WAAAVNNVEA TLLLLKNGAN RDMQDNKEET PLFLAAREGS 2022 

LINSHADVNA VDDLGKSALH WAAAVNNVOA AWLLKNGAN KDMQNNREET PLFLAAREGS 2074 

LINAHADVNA VDEFGKSALH WAAAVNNVOA AAVLLKNSAN KDMQNNKEET SLFLAAREGS 2069 

LITADADINA ADNSGKTALH WAAAVNNTEA VNILLMHHAN RDAQDOKDET PLFLAAREGS 2096 
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hum N 
TAN-1 
Xen N 
Dros N 



YEAAKILLDH FANRDITDHM DRLPRDVARD RMHHDIVRLL DEYNVTPSPP — GTVL— TS 
YETAKVLLDH FANRDITDHM DRLPRDIAQE RMHHDIVRLL DEYNLVRSPQ LHGAPLGGTP 
YETAKVLLDH YANRD1TDHM DRLPRDIAQE RMHHDIVHLL DEYNLVKSPT LHNGPLGAT- 
YEACKALLDN FANREITDHM DRLPRDVASE RLHHDIVRLL DE-HVPRSPQ MLSMTPQAMI 



NLS 



CK II cdc2 



cdc2 



GSRRKKSLSE KVQLSE— SS VTLSPVDSLE iSPHTYVSDTT ISSPM- 
AHRRKKSQDG KGCLLD— SS GML5PVDSLE ISPHGYLSDVA iSPPL- 



A-RRKKSQDC KTTLLDSGSS GVL5PVDSLE [STHGYLSDVS iSPPLJ 

GS-PDNGLDA TGSURRKASS KKTSAASKKA ANLNGLNPGQ LTGGVSGVPG VPPTNSAAQA 
BNTS 



ITSPGILQAS PNPML— ATA APPAPVHAQH 
LRSPF— QOS PSVPLNHLPG MPDTHLGIGH 
MTSPF— iQQS PSMPLNHLTS MPESQLGMNH 



'^YEDCIKNAQS MOSLQGNGLD MIKLDNYAYS MGSPF-HOQE LLNGQGLGMN GNGQRNGVGP 
CK II cdc2 j 



r 



L 



ICGP NRSFLSLKHT PMGKKSRRPS AKSTMPTSLP NLAKEAKDAK 

LCSP NGYLGSLKPG VQGKKVRKPS SKGLACGS KEAKDLK 

ICSP NGYMGNMKPS VQSKKARKPS IKGNGC KEAKELK 

GSPPPGQQOP QLITQPTVIS AGNGGNNGNG NASGKOSNOT AKQKAA KKAKLIE 



ALSPV- 
TLSPP- 
TLSPP- 



AAAAAAAVAA MSHELEGSPV GVGMGGNLPS PYDTSSMYSN AMAAPLANGN PNTGAKQPPS: 



ALSFSNLHEM Q- 



PLAHGASTV LPSVSQLLSH HHIVSPGS- 

LNVAA-KPEM AALGGGGRLA FETGPPRLSH LPVASGTSTV LGSSSGGALN FTVGGSTSLN 
INMAT-KQEM AA— CSNRMA FDAMVPRLTH L-NASSPNTI MS — NGSMH FTVGGAPTMN 
GVLPGGLCGM GGLSGAGNGN SHEQGLSPPY SNQSPPHSVQ SSLALSPHAY LGSPSPAKSR 



2127 
2178 
2170 
2208 

2169 
2219 
2213 
2327 

2235 
2306 
2294 
2445 
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GSAGSLSRLH PVPVPAOW- MNRMEVNETQ YNEMFGMVLA PAEG-THPGI APOSRPPEGK 
GQCEWLSRLG SGMVPNQYNP LRGSVAPGPL STQAPSLQHG -MVGPLHSSL AASALSQWS 
SQCDWLARLO NGMVQNQYDP IRNGIQQGN- AQQAQALQHG LMTS-LHNGL PATTLSQIWT 
PSLPTSPTHI OAMRHATQQK OFGGSNLNSL LGGANGGGW GGGGGGGGGV GQGPQNSPVS 

APQPOSTCPP AVAGPLPTMY Q1P EM ARL-PSVAFP TAMVPQQDGQ VAQTILPAYH 

PPQPHLGVSS AASGHLGRSF LSGEPSQADV QPLGPSSLAV HTILPO-ESP ALPTSLPSSL 
MQQQHHN-SS TTSTHINSPF CSSDISQTDL QQM— SSNNI HSVMP(H)TQ IFAASLPSNL 
QQQLGGLEFG SAGLDLNG-F CGSPOSFHSG QMNPPS — I QSSMSG-SSP STNMLSPSSQ 



' SDWSDVTTSP TRGGAGGGQR GPGTHMSEPPHNN MQVYA 

i SDWSEGVSSP PT SMQ SQIARIPEAFK 

! SDWSEGISSP PT SMQ PQRTHIPEAFK 

; SDWSEGVQSP AANNLYISGG HOANKGSEAIYI 



J 



V 



J 



r 



L 



— < 



— HITTPRE PLPP-IV-TF QLIPKGSIAQ PAG- 



YQGLPSTRL ATQPHLVOTO QVQPONLQMQ QQNLOPANIQ QOQSLQPPPP 

YOAMPNTRL ANQPHLMQAQ QMQQQON LQLHOS 

LGIISPTGSD MGIMLAPPOS SKNSAIMQTI SPQQQQQQQQ QQOOQHQQQQ 



MMHMM 



2320 
2414 
2384 
2565 



PEST -contoinino Region 



PFPASVGKYP iTPPSQHSYAS SNAAERTPSH SGHLQGEHPY LTPSPESPDQ WSSSSPHSA- 2433 

VPPVTAAQFL ITPPSQHSY-S S-PVENTPSH OLOVP-EGPF LTPSPESPDQ WSSSSPHSNV 2530 

TQSMTTAQFL ITPPSQHSY-S S-PMDNTPSH QLQVP-DHPF LTPSPESPDQ WSSSSPHSNM 2497 

HNQQAFYQYL iTPSSQHS GGHTPQH LVQTL-D-SY PTPSPESPGH WSSSSPRSN- 2671 

2471 
2556 
2523 
2703 
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10 20 30 40 50 60 70 80 90 

» » « » t « ♦ » , 

GGAATTCCGC CCGCCCTGOG CCCCGCTCTG CTGTGGGCGC TGCIGGCGCT CTGGCTGTGC TGCGCGGCCC CCGCGCATGC ATTGCAGTGT 
PAIR PAL LWA LLAL WLC CAA PAHA L Q C> 

100 110 120 130 140 150 160 170 180 

* * • * * # * ♦ * 

CGAGATGGCT ATGAACCCTG TGTAAATGAA GGMTGTGTG TTACCTACCA CAATGGCACA GGATACTGCA AATGTCCAGA AGGCTTCTTG 
RDG YEPC VNE GMC VTYH NGT GYC KCPE GFL> 

190 • 200 210 220 230 240 250 260 270 

********* 

GGGGAATATT GTCAACATCG AGACCCCTGT GAGAAGAACC GCTGCCAGAA TGGTGGGACT TGTGTGGCCC AGGCCATGCT GGGGAAAGCC 
GEY CQHR OPC EKN RCQN GGT CVA Q A M L GKA> 

280 290 300 310 320 330 340 350 360 

* * * ****** 

ACGTGCCGAT GTGCCTCAGG GTTTACAGGA GAGGACTGCC AGTACTCAAC ATCICATCCA TGCTTTGTGT CTCGACCCTG CCTGAATGGC 
TCR CASG FTG EOC QYST SHP CFV SRPC LNG> 

370 380 390 400 410 420 430 440 450 

*** ****** 

GGCACATGCC ATATGCTCAG CCGGGATACC TATGAGTGCA CCTGTCAAGT CGGGTTTACA GGTAAGGAGT GCCAATGGAC GGATGCCTCC 
GTC HMLS ROT YEC TCQV GFT GKE CQWT DAO 

460 470 480 490 500 510 520 530 540 
***«>»»»* 

CTGTCTCATC CCTGTGCAAA TGGAAGTACC TGTACCACTG TGGCCAACCA GTTCTCCTGC AAATGCCTCA CAGGCTTCAC AGGGCAGAAA 
LSH PCAN GST CTT VANQ FSC KCL TGFT GQK> 

550 560 570 580 590 600 610 620 630 
*»*»**»*» 
TGTGAGACTG ATGTCAATGA GTGTGACATT CCAGGACACT GCCAGCATGG TGGCACCTGC CTCAACCTGC CTGGTTCCTA CCAGTGCCAG 
CET DVNE CD1 PGH CQHG GTC L N L PGSY QCQ> 

640 650 660 670 680 690 700 710 720 
»»»♦»»♦*» 

TGCCCTCAGG GCTTCACAGG CCAGTACTGT GACAGCCTGT ATGTGCCCTG TGCACCCTCA CCTTGTGTCA ATGGAGGCAC CTGTCGGCAG 
CPQ GFTG QYC DSL YVPC APS PCV NGGT CRQ> 

730 740 750 760 770 780 790 800 810 
*»*♦«««»» 

ACTGGTGACT TCACTTTTGA GTGCAACTGC CTTCCAGGTT TTGAAGGGAG CACCTGTGAG AGGAATATTG ATGACTGCCC TAACCACAGG 
TGD FTFE CNC LPG FEGS TCE R N I ODCP NHR> 
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820 830 840 850 860 870 880 890 900 

* * ♦ » * * * « 4 

TGTCAGAATG GACGGGTTTG TGTGGATGGG GTCAACACTT ACAACTGCCG CTGTCCCCCA CAATGGACAG GACAGTTCTG CACAGAGGAT 
CQN GGVC VDG VNT YNCR CPP OWT GQFC TED> 

910 920 930 940 950 960 970 980 990 
**»*»♦»»» 
GTGGATGAAT GCCTGCTGCA GCCCAATGCC TGTCAAAATG GGGGCACCTG TGCCAACCGC AATGGAGGCT ATGGCTGTGT ATGTGTCAAC 
VOE CLLO PNA CQN GGTC ANR NGG YGCV CVN> 

1000 1010 1020 1030 1040 1050 1060 1070 1080 

* ♦♦♦**♦ * » 
GGCTGGAGTG GAGATGACTG CAGTGAGAAC ATTGATGATT GTCCCTTCGC CTCCTGTACT CCAGGCTCCA CCTGCATCGA CCGTGTGGCC 

GWS GDOC SEN 1DD CAFA SCT PGS TCIO RVA> 

1090 1100 1110 1120 1130 1140 1150 1160 1170 
***«***♦♦ 
TCCTTCTCTT GCATGTGCCC AGAGGGGAAG GCAGGTCTCC TGTGTCATCT GGATGATGCA TGCATCAGCA ATCCTTGCCA CAAGGGGGCA 
SFS CMCP EGK AGL LCHL DDA CIS NPCH KGA> 

1180 1190 1200 1210 1220 1230 1240 1250 1260 

* » * * * « ♦ * « 

CTGTGTGACA CCAACCCCCT AAATGGGCAA TATATTTGCA CCTGCCCACA AGGCTACAAA GGGGCTGACT GCACAGAAGA TGTGGATGAA 
LCD TNPL NGO YIC TCPQ GYK GAD CTED VDE> 

1270 1280 1290 1300 1310 1320 1330 1340 1350 
***»«*♦»♦ 
TGTGCCATGG CCAATAGCAA TCCTTGTGAG CATGCAGGAA AATGTGTGAA CACGGATGGC GCCTTCCACT GTGAGTGTCT GAAGGGTTAT 
CAM ANSN PCE HAG KCVN TOG AFH CECL KGY> 

1360 1370 1380 1390 1400 1410 1420 1430 1440 
♦***♦♦*♦♦ 
GCAGGACCTC GTTGTGAGAT GGACATCAAT GAGTGCCATT CAGACCCCTG CCAGAATGAT GCTACCTGTC TGGATAAGAT TGGAGGCTTC 
AGP RCEM DIN ECH SDPC QND ATC LDKI GGF> 

1450 1460 1470 1480 1490 1500 1510 1520 1530 
♦***♦»»♦« 
ACATGTCTGT GCATGCCAGG TTTCAAAGGT GTGCATTGTG AATTACAAAT AAATGAATGT CAGAGCAACC CTTGTGTGAA CAATGGGCAG 
TCL CMPG FKG VHC ELEI NEC DSN PCVN NGO 

1540 1550 1560 1570 1580 1590 1600 1610 1620 
**♦**«»** 
TGTGTGGATA AAGTCAATCG TTTCCAGTGC CTGTGTCCTC CTGGTTTCAC TGGGCCAGTT TGCCAGATTG ATATTGATGA CTGTTCCACT 
C V D K V N R F Q C L C P P G F T G P V C 0 I D I D D C S S> 
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1630 1640 1650 1660 1670 1680 1690 1700 1710 
*♦««*»»»» 

ACTCCCTGTC TGAATGGGGC AAAGTGTATC GATCACCCGA ATGGCTATGA ATGCCAGTGT GCCACAGGTT TCACTGGTGT GTTGTGTGAG 
TPC LNGA K C I OHP NGYE CQC ATG FTGV LCE> 

1720 1730 1740 1750 1760 1770 1780 1790. 1800 
»*•*»** *» 

GAGAACATTG ACAACIGTGA CCCCGATCCT TGCCACCATG GTCAGTGTCA GGATGGTATT GATTCCTACA CCTGCATCTG CAATCCCGGG 
ENI ONCO POP CHH GOCO 0 G I DSY TCIC NPG> 

1810 1820 1830 1840 1850 1860 1870 1880 1890 

* ♦♦♦♦♦ *» * 
TACATGGGCG CCATCTGCAG TGACCAGATT GATGAATGTT ACAGCAGCCC TTGCCTGAAC GATGGTCGCT GCATTGACCT GGTCAATGGC 
Y M G AICS D 0 I DEC YSSP CLN DGR CIDL VNG> 

1900 1910 1920 1930 1940 1950 1960 1970 1980 
****** » » * 

TACCAGTGCA ACTGCCAGCC AGGCACGTCA GGGGTTAATT GTGAAATTAA TTTTGATGAC TGTGCAAGTA ACCCTTGTAT CCATGGAATC 
YQC NCOP GTS GVN CEIN FDD CAS NPCI HGI> 

1990 2000 2010 2020 2030 2040 2050 2060 2070 

* * * * * * * * * 
TGTATGGATG GCATTAATCG CTACAGTTGT GTCTGCTCAC CAGGATTCAC AGGGCAGAGA TGTAACATTG ACATTGATGA GTGTGCCTCC 
CMD GINR YSC VCS PGFT GOR CNI DIDE CAS> 

2080 2090 2100 2110 2120 2130 2140 2150 2160 
******** * 
AATCCCTGTC GCAAGGGTGC MCATGTATC AACGGTGTGA ATGGTTTCCG CTGTATATGC CCGGAGGGAC CCCATCACCC CAGCTGCTAC 
NPC RKGA TCI NGV NGFR CIC PEG PHHP SCY> 

2170 2180 2190 2200 2210 2220 2230 2240 2250 
******** * 
TCACAGGTGA ACGAATGCCT GACCAATCCC TGCATCCATG GAAACTGTAC TGGAGGTCTC AGTGGATATA AGTGTCTCTG TGATGCAGGC 
SOV NECL SNP CIH GMCT GGL SCY KCLC D A G> 

2260 2270 2280 2290 2300 2310 2320 2330 2340 
*♦**♦♦♦*« 
TGGGTTGGCA TCAACTGTGA AGTGGACAAA AATGAATGCC TTTCGAATCC ATGCCAGAAT GGAGGAACTT GTGACAATCT GGTGAATGGA 
WVG INCE VDK NEC LSNP CQN GGT CONL VNG> 

2350 2360 2370 2380 2390 2400 2410 2420 2430 

* * * * * « » • * 
TACAGGTGTA CTTGCAAGAA GGGCTTTAAA GGCTATAACT GCCAGGTGAA TATTGATGAA TGTGCCTCAA ATCCATGCCT GAACCAAGGA 
YRC TCKF CFK CYN CQVN IDE CAS NPCL NOG) 

FIG.17C 



_9407474A1J_> 



SUBSTITUTE SHEET (RULE 26) 



WO 94/07474 



PCT/LS93/09338 



60/68 

2440 2450 2460 2470 2480 2490 2500 2510 2520 
********* 

ACCTGCTTTG ATGACATAAG TGGCTACACT TGCCACTGTG TGCTGCCATA CACAGGCAAG AATTGTCAGA CAGTATTGGC TCCCTGTTCC 
TCF ODIS GYT CHC VLPY TGK NCQ TVLA PCS> 

2530 2540 2550 2560 2570 2580 • 2590 2600 2610 
********* 

CCAAACCCTT GTGAGAATGC TGCTGTTTGC AAAGAGTCAC CAAATTTTGA GAGTTATACT TGCTTGTGTG CTCCTGGCTG GCAAGGTCAG 
PNP CENA AVC KES PNFE SYT CLC APGW QGQ> 

2620 2630 2640 2650 2660 2670 2680 2690 2700 

********* 

CGGTGTACCA TTGACATTGA CGAGTGTATC TCCAAGCCCT GCATGAACCA TGGTCTCTGC CATAACACCC AGGGCAGCTA CATGTGTGAA 
RCT IDID EC1 SKP CMNH GLC HNT QGSY MCE> 

2710 2720 2730 2740 2750 2760 2770 2780 2790 

********* 

TGTCCACCAG GCTTCAGTGG TATGGACTGT GAGGAGGACA TTGATGACTG CCTTGCCAAT CCTTGCCAGA ATGGAGGTTC CTGTATGGAT 
CPP GFSG MOC EED IDDC LAM PCQ NGGS CMO> 

2800 2810 2820 2830 2840 2850 2860 2870 2880 
********* 

GGAGTGAATA CITTCTCCTG CCTCTGCCTT CCGGGTTTCA CIGGGGATAA GTGCCAGACA" GACATGAATG AGTGTCTGAG TGAACCCTGT 
GVN TFSC LCL PGF TGDK COT DMN ECLS EPD 

2890 2900 2910 2920 2930 2940 2950 2960 2970 
********* 

AAGAATGGAG GGACCTGCTC TGACTACGTC AACAGTTACA CTTGCAAGTG CCAGGCAGGA TTTGATGGAG TCCATTGTGA GAACAACATC 
KNG GTCS DYV NSY TCKC QAG FDG VHCE NNI> 

2980 2990 3000 3010 3020 3030 3040 3050 3060 
********* 

AATGAGTGCA CIGAGAGCTC CTGTTTCAAT GGTGGCACAT GTGTTGATGG GATTAACTCC TTCTCTTGCT TGTGCCCTGT GGGTTTCACT 
NEC TESS CFN GGT CVDG INS FSC LCPV GFT> 

3070 3080 3090 3100 3110 3120 3130 3140 3150 
********* 
GGATCCTTCT GCCTCCATGA GATCAATGAA TGCAGCTCTC ATCCATGCCT GAATGAGGGA ACGTGTGTTG ATGGCCTGGG TACCTACCGC 
GSF CLHE I N E CSS HPCL NEC TCV DGIG TYR> 

3160 3170 . 3180 3190 3200 3210 3220 3230 3240 
* * * * * * * * * 

TGCAGCTGCC CCCTGGGCTA CACTGGGAAA AACTGTCAGA CCCTGGTGAA TCTCTGCAGT CGGTCTCCAT GIAAAAACAA ACGTACT7GT 
CSC PLGY TGK NCO TlVN LCS RSP CKNK G T O 
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3250 3260 3270 3280 3290 3300 3310 3320 3330 
*»»♦♦♦**♦ 
GTTCAGAAAA AAGCAGAGTC CCAGTGCCTA TGTCCATCTG GATGGCCTGG TGCCTATTGT GACGTGCCCA ATGfCTCTTG TGACATAGCA 
VOK KAES QCL CPS GWAG AYC DVP NVSC D i A> 

3340 3350 3360 3370 3380 3390 3400 3410 3420 
******** i 
GCCTCCAGGA GAGGTGTGCT TGTTGAACAC TTCTGCCAGC ACTCAGGTGT CTGCATCAAT GCTGGCAACA CGCATTACTG TCAGTGCCCC 
ASR RGVL VEH LCO HSGV CIN AGN THYC OCP> 

3430 3440 3450 3460 3470 3480 3490 3500 3510 
* * * ♦ * ♦ * * * 

CTGGGCTATA CTGGGAGCTA CTGTGAGGAG CAACTCGATG AGTGTGCGTC CAACCCCTGC CAGCACGGCG CAACATGCAG TGACTTCATT 
LGY TGSY CEE OLD ECAS NPC OHG ATCS DFI> 

3520 3530 3540 3550 3560 3570 3580 3590 3600 
*»*«****♦ 
GGTGGATACA GATGCGAGTG TCTCCCAGGC TATCAGGGTG TCAACTGTGA GTATGAAGTG GATGAGTGCC AGAATCAGCC CTGCCAGAAT 
GGY RCEC VPG YOG VNCE YEV DEC Q N 0 P C Q N> 

3610 3620 3630 3640 3650 3660 3670 3680 3690 
**♦«♦»*»♦ 
GGAGGCACCT GTATTGACCT TGTGAACCAT TTCAAGTGCT CTTGCCCACC AGGCACTCGG GGCCTACTCT GTGAAGAGAA CATTGATGAC 
GGT CIDL VNH FKC SCPP GTR GLL CEEN ID0> 

3700 3710 3720 3730 3740 3750 3760 3770 3780 
**♦»♦»*»» 
TGTGCCCGGG GTCCCCATTG CCTTAATGGT GGTCAGTGCA TGGATAGGAT TGGAGGCTAC AGTTGTCGCT GCTTGCCTGG CTTTGCTGGG 
CAR GPHC LNG GQC MORI GGY SCR CLPG FAC> 

3790 3800 3810 3820 3830 3840 3850 3860 3870 
**♦*♦**»♦ 
GAGCGTTGTG AGGGAGACAT CAACGAGTGC CTCTCCAACC CCTGCAGCTC TGAGGGCAGC CTGGACTGTA TACAGCTCAC CAATGACTAC 
ERC EGDI NEC L S M PCSS EGS LDC IOLT NOY> 

3880 3890 3900 3910 3920 3930 3940 3950 3960 
**♦*♦«*»♦ 
CTGTGTGTTT GCCGTAGTGC CTTTACTGGC CGGCACTGTG AAACCTTCGT CGATGTGTGT CCCCAGATGC CCTGCCTGAA TGGAGGGACT 
LCV CRSA FTG RHC ETFV DVC POM PCLN GGT> 

3970 3980 3990 4000 4010 4020 4030 4040 4050 
****»**»» 
TGTGCTGTGG CCAGTAACAT GCCTGATGGT TTCATTTGCC GTTGTCCCCC GGGATTTTCC GGGGCAAGGT GCCAGAGCAG CTGTGGACAA 
CAV ASNM PDG FIC RCPP GFS GAR COSS CGQ> 
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4060 . 4070 4080 4090 4100 4110 4120 4130 4140 
*♦***«*«* 
GTGAMTGTA GGAAGGGGGA GCAGTGTGTG CACACCGCCT CTGGACCCCG CTCCTTCTGC CCCAGTCCCC GGGACTGCGA GTCAGGCTGT 
VKC RKGE QCV HTA SGPR CFC PSP RDCE SGC> 

4150 4160 4170 4180 4190 4200 4210 4220 4230 
*«**«**♦♦ 
GCCAGTAGCC CCTGCCAGCA CGGGGGCAGC TGCCACCCTC AGCGCCAGCC TCCTTATTAC TCCTGCCAGT GTGCCCCACC ATTCTCGGGT 
ASS PCQH GGS CHP QRQP PYY SCQ CAPP FSG> 

4240 4250 4260 4270 4280 4290 4300 4310 4320 
****♦♦**» 
AGCCGCTGTG AACTCTACAC GGCACCCCCC AGCACCCCTC CTGCCACCTG TCTGACCCAG TATTGTGCCC ACAAAGCTGG GGATGGOGTC 
SRC ELYT APP STP PATC LSQ Y C A DKAR DGV> 

4330 4340 4350 4360 4370 4380 4390 4400 4410 

********* 

TGTGATGAGG CCTGCAACAG CCATGCCTGC CAGTGGGATG GGGGTGACTG rTCTCTCACC ATGGAGAACC CCTGGGCCAA CTGCTCCTCC 
CDE ACNS HAC OWD GGDC SLT MEN PWAN CSS> 

4420 4430 4440 4450 4460 4470 4480 4490 4500 
* ♦ * * ♦ * ♦ » * 

CCACTTCCCT GCTGGGATTA TATCAACAAC CAGTGTGATG AGCTGTGCAA CACGGTCGAG TGCCTGTTTG ACAACTTTGA ATGCCAGGGG 
PLP CWDY INN QCD ELCN TVE CLF DNFE CQG> 

4510 4520 4530 4540 4550 4560 4570 4580 4590 
********* 

AACAGCAAGA CATGCAAGTA TGACAAATAC TGTGCAGACC ACTTCAAAGA CAACCACTGT AACCAGGGGT GCAACAGTGA GGAGTGTGGT 
NSK TCKY OKY CAD HFKD NHC NOG CNSE ECO 

4600 4610 4620 4630 4640 4650 4660 4670 4680 
********* 
TGGGATGGGC TGGACTGTGC TGCTGACCAA CCTGAGAACC TGGCAGAAGG TACCCTGGTT ATTGTGGTAT TGATGCCACC TGAACAACTG 
WOG LOCA A 0 Q PEN LAEG TLV IVV LMPP EQL> 

4690 4700 4710 4720 4730 4740 4750 4760 4770 
********* 
* CTCCAGGATG CTCGCACCTT CTTGCGGGCA CTGGGTACCC TGCICCACAC CAACCTGCGC ATTAAGCGGG ACTCCCAGGG GGAACTCATG 

LOO ARSFLRA LGT LIHT NLR IKR OSOG ELM> 

4780 4790 4800 4810 4820 4830 4840 4850 4860 
********* 
GTGTACCCCT ATTATGGTGA GAAGTCAGCT GCTATGAAGA AACAGAGGAT GACACGCAGA TCCCTTCCTG GTGAACAAGA ACAGGAGGTG 
VYP YYGE KSA AMK KORM TRR SLP CEQE OEV> 
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4870 4880 4890 4900 4910 4920 4930 4940 4950 
******»«, 
GCTCGCTCTA AAGTCTTTCT GGAAATTGAC AACCGCCAGT GTGTTCAAGA CTCAGACCAC TGCTTCAAGA ACACGGATGC AGCAGCAGCT 
AGS KVFl EIO NRQ CVQD SDH CFK N T 0 A A A A> 

4960 4970 4980 4990 5000 5010 5020 5030 5040 

♦ *♦»•»».»» 
CTCCTGGCCT CTCACGCCAT ACAGGGGACC CTGTCATACC CTCTTGTGTC TGTCGTCAGT GAATCCCTGA CTCCAGAACG CACTCAGCTC 

L L A SHAI QGT LSY PLVS VVS ESL TPER TQL> 

5050 5050 5070 5080 5090 5100 5110 5120 5130 
♦****»♦», 
CTCTATCTCC TTGCTGTTGC TGTTGTCATC ATTCTGTTTA TTATTCTGCT GGGGGTAATC ATGGCAAAAC GAAAGCGTAA GCATGGCTCT 
L Y L LAVA V V I I L F MIL G V 1 M A K R K R K H G S> 

5140 5150 5160 5170 5180 5190 5200 5210 5220 
♦**♦♦♦»«» 
CTCTGGCTGC CTGAAGGTTT CACTCTTCGC CGAGAIGCAA GCAATCACAA GCGTGGTGAG CCAGTGGGAC AGGATGCTGT GGGGCTCAAA 
LWL PEGF TLR RDA SNHK RRE PVG ODAV GLK> 

5230 5240 5250 5260 5270 5280 5290 5300 5310 

* * * * * * » « » 
AATCTCTCAG TGCAAGTCTC AGAAGCTAAC CTAATTGGTA CTGGAACAAG TGAACACTGG GTCGATGATG AAGGGCGCCA GCCAAAGAAA 

NLS VOVS EAN LIG TGTS EHW VOD EGPQ PKK> 

5320 5330 5340 5350 5360 5370 5380 5390 5400 
***»*»*«* 
GTAAAGGCTG AAGATGAGGC CTTACTCTCA GAAGAAGATG ACCCCATTGA TCGACGGCCA TGCACACAGC AGCACCTTGA AGCTGCAGAC 
VKA EOEA LLS EEC OPIO RRP WTQ QHLE A A D> 

5410 5420 5430 5440 5450 5460 5470 5480 5490 
******* , , 

ATCCGTAGGA CACCATCGCT GGCTCICACC CCTCCTCAGG CAGAGCAGGA GGTGGATGTG TTAGATGTGA ATGTCCGTGG CCCAGATGGC 
IRR TPSL ALT PPQ AEQE VDV LOV NVRG PDG> 

5500 5510 5520 5530 5540 5550 5560 5570 5580 
***♦♦**♦♦ 
TGCACCCCAT TGATGTTGGC TTCTCTCCGA GGAGGCAGCT CAGATTTGAG TGATGAAGAT GAAGATGCAG AGGACTCTTC TGCTAACATC 
CTP LMLA SLR CCS SDLS DEO EDA EDSS ANI> 

5590 5600 5610 5620 5630 5640 5650 5660 5670 
******»«♦ 
ATCACAGACT TGGTCTACCA GGGTGCCAGC CTCCAGGCCC AGACAGACCG GACTGGTGAG ATGGCCCTGC ACCTTGCAGC COGCTACTCA 
I T 0 L V Y Q GAS L 0 A 0 T D R T G E MAI H L A A R Y S> 
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5680 5690 5700 5710 5720 5730 5740 5750 5760 
********* 
CGGGCTGATG CTGCCAAGOG TCTCCTGGAT GCAGGTGCAG ATGCCAATGC CCAGGACAAC ATGGGCCGCT GTCCACTCCA TGCTGCAGTG 
RAD A A K R LID AC A DANA QDN MGR CPLH AAV> 

5770 5780 5790 5800 5810 5820 5830 5840 5850 
***** * * * * 

GCAGCTGATG CCCAAGGTGT CTTCCAGATT CTGATTCGCA ACCGAGTAAC TGATCTAGAT GCCAGGATGA ATGATGGTAC TACACCCCTG 
A A D AQGV FOI LIR NRVT OLD ARM NDGT TPL> 

5860 5870 5880 5890 5900 5910 5920 5930 5940 
*♦♦»♦♦*** 
ATCCTGGCTG CCCGCCTGGC TGTGGAGGGA ATGGTGGCAG AACTGATCAA CTGCCAAGOG GATGTGAATG CAGTGGATGA CCATGGAAAA 
1 L A ARLA VEG II V A E L I N CQA DVN AVDD HGK> 

5950 5960 5970 5980 5990 6000 6010 602O 6030 
»♦*»***** 
TCTGCTCTTC ACTGGGCAGC TGCTGTCAAT AATGTGGAGG CAACTCTTTT GTTGTTGAAA AATGGGGCCA ACCGAGACAT GCAGGACAAC 
SAL H W A A A V N NVE ATLL L L K NGA NRDM QDN> 

6040 6050 6060 6070 6080 6090 6100 6110 6120 
***** * * * ♦ 

AAGGAAGAGA CACCTCTGTT TCTTGCTGCC CGGGAGGGGA GCTATGAAGC AGCCAAGATC CTGTTAGACC ATTTTGCCAA TCGAGACATC 
KEE TPLF LAA REG SYEA AKI LLD H F A N RDI> 

6130 6140 6150 6160 6170 6180 6190 6200 6210 
* * * * * * * * * 

ACAGACCATA TGGATCGTCT TCCCCGGGAT GTGGCICGGG ATCGCATGCA CCATGACATT GTGCGCCTTC TGGATGAATA CAATGTGACC 
TDH MDRL PRO VARDRMH HDI VRL LDEY NVT> 

6220 6230 6240 6250 6260 6270 6280 6290 6300 
»***«♦*** 
CCAAGCCCTC OCGCACCGT GTTGACTTCT GCTCTCTCAC CTGTCATCIG TGGGCCCAAC AGATCTTTCC TCAGCCTGAA GCACACCCCA 
PSP PGTV LTS ALS PVIC GPN RSF LSLK HTP> 

6310 6320 6340 6350 6360 6370 6380 6390 6400 
***** »*** 
ATGCGCAAGA AGTCTAGACG GCCCAGTGCC AAGAGTACCA TGCCTACTAG CCTCCCTAAC CTTGCCAAGG AGGCAAAGGA TGCCAAGGGT 
MGK KSRR PSA KST MPTS LPN LAK EAKD A K G> 

6400 6410 6420 6430 6440 6450 6460 6470 6480 
********* 

AGTAGGAGGA AGAAGTCTCT GAGTGAGAAG GTCCAACTGT CTGAGAGTTC AGTAACTTTA TCCCCTGTTG ATTCCCTAGA ATCTCCTCAC 
SRR KKSL SEK VOL SESS VTL SPV DSLE SPH> 
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6490 6500 6510 6520 6530 6540 6550 6560 6570 

ACGTATGTTT CCGACACCAC ATCCTCTCCA ATGATTACAT CCCCTGGGAT CTTACAGGCC TCACCCAACC CTATGTTGGC CACTGCCGCC 
T Y V S 0 T T S S P MIT S P G I L 0 A S P N P M L A T A A> 

6580 6590 6600 6610 6620 6630 6640 6650 6660 

CCTCCTGCCC CAGTCCATGC CCAGCATGCA CTATCTTTTT CTAACCTTCA TGAAATGCAG CCTTTGGCAC ATGGGGCCAG CACTGTGCTT 
P P A P V H A Q H A L S P S N L H E M 0 P L A H G A S T V L> 

6670 6680 6690 6700 6710 6720 6730 6740 6750 
***»»**♦♦ 
CCCTCAGTGA GCCAGTTGCT ATCCCACCAC CACATTGTGT CTCCAGGCAG TGGCAGTXT GGAACCTTGA GTAGGCTCCA TCCAGTCCCA 
P S V SQL I S H H HIV S P G S G S A G S L S R L H P V P> 

6760 6770 6780 6790 6800 6810 6820 6830 6840 
♦***♦«♦«, 
GTCCCAGCAG ATTGGATGAA CCGCATGGAG GTGAATGAGA CCCAGTACAA TGAGATGTTT GGTATGGTCC TGGCTCCAGC TGAGGGCACC 
V P A 0 W M N R U E V N E T 0 Y N E M F GUV L A P A E G T> 

6850 6860 6870 6880 6890 6900 6910 6920 6930 
**♦»♦♦*♦« 

CATCCTGGCA TAGCTCCCCA GAGCAGGCCA CCTGAAGGGA AGCACATAAC CACCCCTCGG GAGCCCTTGC CCCCCATTGT GACTTTCCAG 
HPG 1APQ SRP PEG K H I T TPR EPL PPIV TFQ> 

6940 6950 6960 6970 6980 6990 7000 7010 7020 
***♦*♦♦», 
CTCATCCCTA AAGGCAGTAT TGCCCAACCA GCCGGGGCTC CCCAGCCTCA GTCCACCTGC CCTCCAGCTG TTGCCGGCCC CCTGCCCACC 
L ' p KGSI AQP AGA POPO STC PPA VAGP LPT> 

7030 7040 7050 7060 7070 7080 7090 7100 7110 
♦ ♦ » ♦ , , , # , 

ATGTACCAGA TTCCAGAAAT GGOCCGTTTG CCCAGTGTGG CTTTCCCCAC TGCCATGATG CCCCAGCAGG ACGGGCAGGT AGCTCAGACC 
MY Q IPEM ARL PSV AFPT AMM PQQ DGQV AQT> 

7120 7130 7140 7150 7160 7170 7180 7190 7200 

ATTCTCCCAG CCTATCATCC TTTCCCAGCC TCTGTGGGCA AGTACCCCAC ACCCCCTTCA CAGCACAGTT ATGCTTCCTC AAATGCTGCT 
1 L P A Y H P F P A SVG K Y P T P P S 0 H S Y A S S N A A> 

7210 7220 7230 7240 7250 7260 7270 7280 7290 

GAGCGAACAC CCAGTCACAG TGGTCACCTC CAGGGTGAGC ATCCCTACCT GACACCATCC CCAGAGTCTC CTGACCAGTG GTCAAGTTCA 
ERT PSHS GHL QGE HPYL TPS PES POQW SSS> 



FIG. 171 



9407474A1_I_: 



SUBSTITUTE SHEET (RULE 26) 



WO 94/07474 



PCT/US93/09338 



66/68 

7300 7310 7320 7330 7340 7350 7360 7370 7380 
»**»«»*** 
TCACCCCACT CTGCTTCTGA CTGGTCAGAT GTGACCACCA GCCCTACCCC TGGGGGTGCT GGAGGAGGTC AGCGGGGACC TGGGACACAC 
SPH SASD If S D VTT SPTP GGA GGG QRGP GTH> 

7390 7400 7410 7420 7430 7440 7450 7460 7470 
*«***«**♦ 
ATGTCTGAGC CACCACACAA CAACATGCAG GTTTATGOGT GAGAGAGTCC ACCTCCAGTG TAGAGACATA ACTGACTTTT GTAAATGCTG 
MSE PPHN NMQ VYA> 

7480 7490 7500 7510 7520 7530 7540 7550 7560 
********* 
CTGAGGAACA AATGAAGGTC ATCCGGGAGA GAAATGAAGA AATCTCTGGA GCCAGCTTCT AGAGGTAGGA AAGAGAAGAT GTTCTTATTC 

7570 7580 7590 7600 7610 7620 7630 7640 7650 
********* 
AGATAATGCA AGAGAAGCAA TTCGTCAGTT TCACTGGGTA TCTGCAAGGC TTATTGATTA TTCTAATCTA ATAAGACAAG TTTGTGGAAA 

' 7660 7670 7680 7690 7700 7710 7720 7730 7740 
********* 
TGCAAGATGA ATACAAGCCT TGGGTCCATG TTTACTCTCT TCTATTTGGA GAATAAGATG GATGCTTATT GAAGCCCAGA CATTCTTGCA 

7750 7760 7770 7780 7790 7800 7810 7820 7830 
********* 
GCTTGGACTG CATTTTAAGC CCTGCAGGCT TCTGCCATAT CCATGAGAAG ATTCTACACT AGCGTCCTGT TGGGAATTAT GCCCTGGAAT 

7840 7850 7860 7870 7880 7890 7900 7910 7920 
********* 
TCTGCCTGAA TTGACCTACG CATCTCCTCC TCCTTGGACA TTCTTTTGTC TTCATTTGGT GCTTTTGGTT TTGCACCTCT CCGTGATTGT 

7930 7940 7950 7960 7970 7980 7990 8000 8010 

* *»♦*»*«* 
AGCCCTACCA GCATGTTATA GGGCAAGACC TTTGTGCTTT TGATCATTCT GGCCCATGAA AGCAACTTTG GTCTCCTTTC CCCTCCTGTC 

8020 8030 8040 8050 8060 8070 8080 8090 8100 

* «♦****** 
TTCCGGGTAT CCCTTGGAGT CTCACAAGGT TTACTTTGGT ATGGTTCICA GCACAAACCT TTCAAGTATG TTGTTTCTTT GGAAAATGGA 

8110 8120 8130 8140 8150 8160 8170 8180 8190 

* ♦****♦*♦ 
CATACTGTAT TGTGTTCTCC TGCATATATC ATTCCTGGAG AGAGAAGGGG AGAAGAA1AC TTTTCTTCAA CAAATTTTGG GGGCAGGAGA 

8200 8210 8220 8230 8240 8250 8260 8270 8280 

* * * * * ♦ * * » 
TCCCTTCAAG AGGCTGCACC TTAATTTTTC TTGTCTGTGT GCAGGTCTTC ATATAAACTT TACCAGGAAG AAGGGTGTGA GTTTGTTGTT 
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8290 8300 8310 8320 8330 8340 8350 8360 8370 
*»****»«, 
TTTCTGTGTA TGGGCCTGGT CAGTGTAAAG TTTTATCCTT GATAGTCTAG TTACTATGAC CCTCCCCACI TTTTTAAAAC CAGAAAAAGG 

8380 8390 8400 8410 8420 8430 8440 8450 8460 
*»*»»♦*», 
TTTGGAATGT TGGAATGACC AAGAGACAAG TTAACTCGTG CAAGAGCCAG TTACCCACCC ACAGGTCCCC CTACTTCCTG CCAAGCATTC 

8470 8480 8490 8500 8510 8520 8530 8540 8550 
***«*«»»» 
CATTGACTGC CTGTATGGAA CACATTTGTC CCAGATCTGA GCATTCTAGG CCTGTTTCAC TCACTCACCC AGCATATGAA ACTAGTCTTA 

8560 8570 8580 8590 8600 8610 8620 8630 8640 
***»*♦**» 
ACTGTIGAGC CTTTCCTTTC ATATCCACAG AAGACACTGT CTCAAATGTT GTACCCTTGC CATTTAGGAC TGAACTTTCC TTAGCCCAAG 

8650 8660 8670 8680 8690 8700 8710 8720 8730 
♦ ♦ ♦ ♦ * * * ♦ « 

GGACCCAGTG ACAGTTGTCT TCCGTTTGTC AGATCATCAC TCTCTACTGA TTATCTTGCT GCTTAAAGGC CTGCTCACCA ATCTTTCTTT 

8740 8750 8760 8770 8780 8790 8800 8810 8820 
*♦**»»*♦♦ 
CACACCGTGT GGTCCGTGTT ACTGGTATAC CCAGTATGTT CTCACTGAAG ACATGGACTT TATATGTTCA AGTGCAGGAA TTGGAAAGTT 

8830 8840 8850 8860 8870 8880 8890 8900 8910 
•*•♦♦«♦»♦ 
GGACTTGTTT TCTATGATCC AAAACAGCCC TATAAGAAGG TTGGAAAAGG AGGAACTATA TAGCAGCCTT TGCTATTTTC TGCTACCATT 

8920 8930 8940 8950 8960 8970 8980 8990 9000 
♦♦♦**«♦»« 
TCTTTTCCTC TGAAGCGGCC ATGACATTCC CTTTGGCAAC TAACGTAGAA ACTCAACAGA ACATTTTCCT TTCCTAGAGT CACCTTTTAG 

9010 9020 9030 9040 9050 9060 9070 9080 9090 
**♦***♦»« 
ATGATAATGG ACAACTATAG ACTTGCTCAT TGTTCAGACT GATTGCCCCT CACCTGAATC CACTCTCTGT ATTCATCCTC TTGGCAATTT 

9100 9110 9120 9130 9140 9150 9160 9170 9180 
****♦♦♦»« 
CTTTCACTn CTTTTAAGGG CAGAAGCATT TTAGTTAATT GTAGATAAAG AATAGTTTTC TTCCTCTTCT CCTTGGCCCA GTTAATAATT 

9190 9200 9210 9220 9230 9240 9250 9260 9270 
♦****»♦»« 
GCTCCATGGC TACACTGCAA CTTCCGTCCA GTGCTGTGAT GCCCATGACA CCTGCAAAAT AAGTTCTGCC TGGGCATTTT GTAGATATTA 
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9280 9290 9300 9310 9320 9330 9340 9350 9350 
ACAGGTGAAT TCCCGACTCT TTTGGTTTGA ATGACAGTTC TCATTCCTTC TATGGCTGCA AGTATGCATC AGTGCTTCCC ACTTACCTGA 

9370 9380 9390 9400 9410 9420 9430 9440 9450 
TTTGTCTGTC GGTGGCCCcI TATGGAAACC CTGCGTGTCT GTTGGCATAA TACTTTACAA ATGGTTTTTT CAGTCCTATC CAAAT7TATT 

9460 9470 9480 9490 9500 9510 9520 9530 9540 
GAACCAACAA AAATAATTAC TTCTGCCCTG AGATAAGCAG ATTAAGTTTG TTCATTCTCT GCTTTATTCT CTCCATGTGG CAACATTCTG 

9550 9560 9570 9580 9590 9600 9610 9620 9630 
TCAGCCTCTT TCATAGTGTG CAAACATTTT ATCATTCTAA ATGGTGACTC TCTGCCCTTG GACCCATTTA TTATTCACAG ATGGGGAGAA 

9640 9650 9660 9670 9680 9690 9700 9710 9720 
CCTATCTGcI TGGACCCTcl CCATCCTCTG TGCAGCACaJ ACAGTGCAG^ CAGCCAGTG^ CGATGGCGAT GACTTTCTTC CCCTGGGAAT 
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